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Abstract. Global existence and long-time behavior of solutions to a family of nonlinear fourth 
order evolution equations on R, d are studied. These equations constitute gradient flows for the 
perturbed information functionals 

i r .o a c 

Ax 



,(u) = ^-[ \D(u a )\ 2 dx+^ [ \x\*uda 
2a J R d 2 J R d 



with respect to the L 2 -Wasserstein metric. The value of a ranges from a = 1/2, corresponding 
to a simplified quantum drift diffusion model, to a = 1, corresponding to a thin film type 
equation. 



1. Introduction and Main Results 

This paper is concerned with non-negative solutions to the following family of nonlinear fourth 
order parabolic problems in (0, +oo) x R d , 

(1.1) d t u = -div (uB [/"'Am ]! + \div(xu) = 0, 

(1.2) u(0,x) = u {x) > 0, / (l + \x\ 2 )u (x)dx < +oo. 

For the exponent a in (|1.1[) we consider values 1/2 < a < 1, and we assume A > 0. This class of 
equations has recently been considered by Denzler and McCann, who constructed special solutions 
[21]. However, apart from the special cases a = 1/2 and a = 1, which are described in some detail 
below, analytical results for general solutions to (|1.1| do not seem to have been available until 
now. The current paper provides proofs for the existence of weak solutions and their long-time 
behavior under mild assumptions on uq. 

Solutions to (jl.ip can be interpreted as nonlinear (confined) diffusion processes. In fact, our 
results show remarkable similarities between (|l.lj) and the well-studied second-order (non-)linear 
Fokker Planck equations [39] . 

(1.3a) d t v = e a A(v a+1/2 ) + A aA div(xv) if a > 1/2, 

(1.3b) d t v = -Av + Ai /2iA div(xv) if a =1/2, 

with parameters defined by 



(1.4) e « : =T-^ A Q , A :=v^C 5 a :={2a-l)d + 2. 

2a + 1 

The first similarity is that both equations and (|1.3|) admit non-negative global weak solutions 
u and v, respectively. The preservation of non- negativity by solutions to (jl.ip is a remarkable fact 
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in its own right, since maximum principles do not generally apply to fourth order equations. 
Moreover, the integral (total mass) 



(1.5) m = m(u) := / udx 

Jn d 

is preserved in time for both equations. It is no loss of generality to assume unit mass m = 1 in 
the following. 

Further, as is characteristic for diffusion processes, both solutions u and v dissipate a variety of 
entropy functionals, like the perturbed entropy H a \ and perturbed information F a ^\ introduced 
in (|1.8p - (|1.9b|) below. For A > 0, this leads to the convergence of u and v to stationary solutions 
Moo and Voo in the limit t —> oo. The attained profiles and Vqo are independent of the initial 
conditions uq and vq, respectively. 

In fact, the most striking similarity of (|1.1|) and (|1.3p . first observed in [21], is that the stationary 
solutions jtoo and are identical. They are Barenblatt profiles or Gaussians, respectively, 

(1.6a) b aA .. m (x) = (a-bHV /M/2) , b=^-i^A Q , A if a > 1/2, 

T V2a 



(1.6b) h/2,X;m(x) = a exp ( - A 1/2jA \x\ 2 ) if a = 1/2. 

Above, the positive parameter a is chosen to adjust the mass m to unity. 

One of the goals of this paper is to provide an explanation of the aforementioned similarities be- 
tween the fourth and the second order equations. By exploiting the gradient flow structure of 1|) . 
these similarities turn out to reflect a natural correspondence between entropy and information 
functionals. 

Fourth order equations and gradient flows. The family fll.ip of nonlinear fourth order equa- 
tions possesses a gradient flow structure [H] . This fact has been made use of by Giacomelli and 
Otto [34] [25] in the case a = 1, and by Gianazza, Savare and Toscani [24] in the case a = 1/2, 
but it seems not to have been exploited previously for the intermediate range 1/2 < a < 1. 

The suitable metric space is that of probability measures on R d , endowed with the L 2 -Wasscrstcin 
distance. Observe that equation (|1 . ip is formally equivalent to the canonical form of a Wasserstcin 
gradient flow [3] Ex. 11.1.2] 

(1.7) d t u + dw(uv)=o, v = -D SFa : x{u) , 

ou 

where F a ^\ is the perturbed information functional defined for smooth positive densities as 



(1.8) 



F a ,x(u) :~ f \V(u a )\ 2 dx + ^ [ \x\ 2 udx 
2a J R d 2 J Rd 



and SF a ,\/Su denotes its Eulerian first variation, 

= -V- (au 2a - 2 D u) + a(a ~ l^-'lD uf + -\x\ 2 = -u^ 1 A(u a ) + -\x\ 2 . 
ou 1 1 2 2 

We remark that the range 1/2 < a < 1 corresponds to the range of convexity of F a ^\ with respect 
to linear interpolation of measures. On the other hand, we emphasize that even in this range of 
a's, the functionals F a .\ are not geodesically convex (= displacement convex); see Carrillo and 
Slepcev [13j . The latter fact makes it impossible to apply the standard machinery for metric 
gradient flows developed e.g. by Ambrosio, Gigli and Savare [S] in a straight-forward manner. 

The information establishes the link between ()l.ip and the second order diffusion equation 
Namely, consider in addition the perturbed entropy functionals 

O a / f ™ i 1 In , , x^v i 1 /o\ A c 



(1.9a) Ha . x (u) = -^-( u a+1 / 2 dx-m(ur +1 / 2 )+^ \x\ 2 udx if a > 1/2, 
a - 1/2 \J Rd J 2 J Rd 

(1.9b) H 1/2jX (u) = \( f ulogudz-m(u)log(m(u))) + [ \x\ 2 udx if a = 1/2, 
l \ J Rd J I J R d 



A FAMILY OF NONLINEAR FOURTH ORDER EQUATIONS 



3 



which Otto showed generate (|1.3|) as gradient flow with respect to the L 2 -Wasserstein- metric [35] . 
Define the associated entropy production as the time derivative of H a \ along any sufficiently 
regular solution v(t) to (| 1 . 3|) . The latter amounts to 

(1.10) -^H a , x (v(t)) = F a ,x(v(t)) - (5 a - 2)AH a , x (v(t)). 

The special form of p.lO|) . as well as the choice of A and 8 in (|1.4|) . is related to the rescaling 
properties of the unperturbed functionals F a fi and H a o with respect to the "mass preserving 
dilations" 

(1.11) D r u(.) := r^uir' 1 ■) r > 0, 
which satisfy 

(1.12) F a! o(p r u) = r- 5 «F{u), H ai0 (X> r u) = r { ~ K+2)/2 H a . a {u) Vr > 0. 

The connection between (|1.12[) and (|1.10[) are clarified in Remark 1 2. 11 

Relation ()1.10p is at the very basis of our investigations of the long-time behavior of solutions 
u(t) to (|1.1|) : as H a \ is a geodesically A-convex functional [32], it generates a A-contractive 
gradient flow (|1.3[) . as in Ambrosio, Gigli and Savare [3J, Carrillo McCann and Villani [TJ], and 
Sturm and von Renesse [35] [SZ] ; after work of Otto [35] . By means of (jl.lOp , attraction towards 
the fixed point is inherited by the gradient flow (jl.ip . regardless of the fact that its generating 
functional F a ^\ is not geodesically convex. The respective proof, in a very general setting, is 
contained in section [3] 

The limiting cases: DLSS and thin film equation. Another motivation for studying the 
family originates from the equations obtained for a — 1/2 and a — 1. When a = 1/2 and 
A = 0, the respective entropy (|1.9bj) turns into (half) the Boltzmann functional, 

1 

2~w> ~ v -, ■ y Rd Vm(it)^ 
and the information coincides with the classical Fisher information, 



(1.13) H 1/2fi [u] = -H(u), H(u) := / u log ( ) dx, 



(1.14) fi/2,o(«)=/ |DV^| 2 dx=i/ t^-dx. 

Jr* 4 J Rd u 

The minimizers of the associated A-perturbed functionals are Gaussians (jl.6bp . For the respective 
unperturbed gradient flow, one recovers the Derrida-Lebowitz-Speer-Spohn equation 

(1.15) d t u + diy(uT>^0-) = 0. 

Originally, those four authors derived (|1.15p in one spatial dimension on the half-line R + as a 
description for interface fluctuations in the Toom model [22] [23]. Later, this equation was re- 
discovered in the mathematical theory of semiconductors. In three spatial dimensions, equation 
(j!.15|) constitutes the low-temperature, field-free limit of a simplified quantum drift diffusion 
system for the electron density u; see [31] and references therein. Existence of solutions to the 
initial value problem for 15[) has been studied by Bleher, Lcbowitz and Speer [5J, Gianazza, 
Savare, and Toscani [21], and Jiingel with Pinnau [3T] and with Matthes [2H]. While still little 
is known about the qualitative properties of general solutions (strict positivity is essentially an 
open problem, as is uniqueness), a variety of results on the rates for relaxation to equilibrium are 
available [30] . 

On the other hand, the parameter a = 1 corresponds to 

Hi, (u) = ^( [ U 3/2 da;-m( U ) 3 / 2 Y F 1>0 (u) = \ f \Bu\ 2 dx, 

6 \J R d J A J R d 

respectively, and their associated A-perturbations. Notice that F\fi(u) is the Dirichlet energy of 
u. Both -ffi.o(') and -Fi,o( - ) are minimized by the Smyth-Hill profiles 

&i,A ; m(^) = 37-5 — 7^{p 2 — |^c| 2 ) ^ , p > being chosen to adjust the mass m to unity. 
8(u + 2) + 
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Equation turns into the following particular thin film equation, 

(1.16) d t u + div(uBAu) = 0. 

The degenerate parabolic fourth order thin film (or lubrication) equations of the more general 
form 

(1.17) d t u + div(m(u)DAu) = 

with an increasing, non-negative mobility function m have been extensively studied in the math- 
ematical literature since the seminal paper of Bcrnis and Friedman [B]. The theory of existence 
of weak solutions to (|1.17p in dimensions d = 1, 2, 3 with physically relevant mobility functions is 
quite complete, see Bertsch, Dal Passo, Garcke and Griin [7] and the references there. Proper- 
ties of the solution, in particular the rate of convergence to the equilibrium state, the spreading 
behavior of the support, and waiting time phenomena have been heavily investigated in a series 
of publications; see e.g. [IBJ [3BJ and the review by Becker and Griin [BJ. However, fundamental 
questions — like the rupture of film for specific mobility functions — still remain largely open, 
even in one spatial dimension. 

The particular equation (|1.16p . which is (|1.17|) with the linear mobility function m(£) = £, 
plays a distinct role in the theory, at least in d = 1 spatial dimension. The equation generates the 
so-called Hele-Shaw flow, which describes the pinching of thin necks (with thickness u) in a Hclc- 
Shaw cell. The mathematical literature on problems related to the Hele-Shaw flow is extensive; 
an overview over classical results can be obtained e.g. in Myers |33j . Analytical treatment in 
the spirit of our investigations is found in Almgrcn, Bcrtozzi, and Brenner [TJ, Otto [34] . and 
particularly in Carlen and Ulusoy [10] and Carrillo and Toscani [15] . 

Main results. The results of this paper are twofold. The first concerns the existence of global 
weak solutions to the initial value problem (|l.l[) fc (|1.2p . 

In order to formulate the existence theorem, we introduce our notion of weak solution. Observe 
that the non-linearity q inside the first divergence operator of (jl.ip can be formally rewritten as 

(1.18) q = uD^-'Au" 1 ) = D (u a Au a ) - u"' 1 (Au a ) T) u = D (u a Au a ) - a _1 (Au a ) D u a . 
This expression has a (distributional) meaning even if a = u a € W^(R d ) only. 
Theorem 1.1. Assume that the non-negative initial condition uq G L 1 (R d ) satisfies 

(1.19) / \x\ 2 uq(x) dx < +oo, H(uq) = / uq(x) log (uq(x)) dx < +oo. 
jR d Jn d 

Then there exists a non-negative global solution u € C°([0, +oo); L 1 (R rf )) with u(0) = uq and 
u a <G Lf oc ([0, +oo); M /2 ' 2 (R d )), satisfying the initial value problem (|l.ip & ([1.2p in the following 
sense 

(1.20) d t u + div (^D(u a Au a ) -a' 1 (A«")Dm° - Xux^j =0, 
i.e., for every test function £ € C^°((0, +00) x R d ) one has 

(1.21) / / (~d t ( + \x-D()udxdt+ [ [ Au a (u a AC + a _1 Dit a -DC) dxdt = 0. 

JO JR d Jo JR d V ' 

Remark 1.1 (Weaker formulation). An even weaker form of (jl.ip exists, which just requires 
a = u a G Ll oc ((0, +00); Vt /1 ' 2 (R d )). Indeed, observe that the vector q defined in (|1.18p has 
components (repeated indices are summed) 

q t = di(o-d] j( x) - a-^dio-^a = - d^af - a^d^d^a) + a" 1 ^ 2 a)d,a 



The respective weaker formulation of (|1.20p reads for A = as 



(1.22) dtU+ l A \a 2 )-(l-^) A\Da\ 2 -^df j (d i ad j a) = 0, o = 
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This corresponds to the integral condition 

(1.23) / / (-d t ( + \x-D()udxdt + / N a {u;D()dt = VC G C o °°((0, +00); R d ), 
upon defining for it E W 1,1 (R d ) with cr = u a e W h2 (R d ) and C = DC £ C °°(R d ;R d ) 

(1.24) iV Q (w;C):= — -/ fa Da 2 ■ DdivC + (2a - 1)1 Deri 2 divC + 2DCDcr • Da) Ax. 

2a J Rd \ ) 

It is not difficult to check that (fl~2"4"l) and (fl~2"i) are equivalent if a = u a £ L\ oc {{®, +00); W 2 ' 2 (R d )). 
Although the link of (fTTj) to (fl~2"2"f is less evident than to (|L"2"0"]) . the weaker form (fL2"2"]) is well- 
suited to our analysis for several reasons: first. (|1.22|) is the natural variational formulation of the 
gradient flow equation that is obtained by the method of Jordan, Kinderlchrer and Otto [27], see 
Lemma \2.5l second, the pointwise condition a = u a <E W 1,2 (R d ) is satisfied as soon as i^^fit] is 
finite; third, strong convergence of a = u a in L\ oc {(0, +00); W 1,2 (R. d )) is sufficient to pass to the 
limit in the nonlinear operator N a . In particular, the last point is important for the passage to 
the continuous time limit in the minimizing movement scheme introduced in the next paragraph. 

The second result concerns the convergence of u in the long-time limit. 

Theorem 1.2. If A > then the weak solutions u to the initial value problem (|l.ip & (|1.2|) con- 
structed as the small time-step limit of the minimizing movement variational scheme (|1.32p ( which 
in particular satisfy the properties stated in Theorem \1.1\) converge exponentially fast towards the 
respective Barenblatt profile b a! X;m * n ^(R )> 

(1-25) || u(t)-b a! x- m \\ Ll(Rd) < C (H a , x (uo) - H a ^(b a ^, m )) 1/2 e- xt , 

where the constant C > only depends on a, A,(i, m. Moreover, the entropy and the information 
functionals H aj \,F aj \ decay along this solution at an exponential rate, 

(1.26) H a ^(u(t)) - H a ^(b a , x , m ) < (H a , x (u ) - H a , x (b a , x , m ))e' 2M , 

(1.27) Fa,x{u{t)) - F a:X {b a ,x:m) < (F a ,x{u ) - F a:X (b a ,x:m))e~ 2Xt . 

We refer to section [S] for more details, and also an improved convergence result in W 1,2 (R d ) 
for the "thin film" case a = 1. 

In the unconfined case A = 0, there exists no non-trivial stationary solution, because u(t) tends 
to zero in Lj oc (R d ) as t — * 00. However, we are able to describe the intermediate asymptotics: u 
approaches the self-similar spreading Barenblatt profile 

(1.28) b afi;m (t, ■) := fl(t)-''&a,lim(fl(t)" 1 ■) = £>fl( t ) Vi;m, R(t) := (l + (S + 2)t) V(5+2) , 

in L 1 (R rf ) at an algebraic rate; see Corollary 15. 51 Thanks to the scaling invariancc (|1.12|) of equa- 
tion Ql.ip , the latter result can be recovered from p.25[) . The corresponding rescaling argument is 
by now classical, see e.g. Dolbeault and del Pino [TH]. In Section [S~3l we will present a variational 
derivation of it, that is entirely based on the invariance property p,12p . In the particular case 
a = 1 of the thin film equation, it leads the following result: 

Corollary 1.3. Assume a = 1 and A = 0. Then any weak solution u to the initial value problem 
(|l.ip & (|1.2[) constructed as limit of the minimizing movement variational scheme satisfies 

(1.29) \\u(t, ■) -&i,o ;ro (f, OlliW) ^ CR{ty\ || D«((, ■) -Db lfi;m (t, OHl^r-) < CR{t)- 2 - d / 2 . 

Variants of the estimates in Theorem 11.21 are already known is two special cases. For the 
situation a = 1/2 of the Derrida-Lebowitz-Speer-Spohn equation (|1.15p , results equivalent to 
those of Thcorcm ll.2l have been recently obtained by Gianazza, Savare and Toscani [55]. Moreover, 
alternative estimates on entropy and information decay are available for the Helc-Shaw flow 16)1 
in dimension d = 1, 

(1.30) d t u = -{uu xxx ) x + (xu) x . 

In [H] , entropy decay for solutions to (|1.30p — at the same exponential rate — has been obtained 
for strong solutions u. Instead of resorting to the general formalism of Wasserstein gradient 
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flows, Carrillo and Toscani rewrite (| 1 .30(1 in an ingenious way, so that all estimates can be made 
completely explicit. As we show in Section [3TU their appealing formal idea can in fact be extended 
to the whole family of equations (jl.ip with a > 1/2, and to arbitrary dimensions d > 1. 
Finally, we mention that the behavior of the perturbed information 

^i,iM = q / | (•■£)! 2 + — [ x 2 u(x)dx 

2 JR 2 JR 

along solutions to (|1.30[) was studied recently. By proving asymptotic cquipartition of kinetic 
and potential energy, an algebraic decay estimate was derived by Carlen and Ulusoy |10j . and 
subsequently improved by Carlen [3] , leading to the same exponential equilibration behavior that 
we find. 

The discrete variational scheme. In order to prove Theorem ll.il we closely follow the strategy 
introduced by Jordan, Kinderlehrcr and Otto in the framework of the Wasserstein space, and 
further developed in the book of Ambrosio, Gigli and Savare [3]. After mass renormalization 
m = 1, we associate to the time dependent density function u(t) satisfying (JTTTJ) the (absolutely 
continuous) probability measures fi(t) := u(t, -)L d in R d . The quadratic moment of /j,(t) is finite 
for all times t > if it is initially at t = 0. Hence /i attains values in the Wasserstein space ^(R^), 
metrized by the L 2 - Wasserstein distance W%{-, •); we recall the relevant definitions in sectional 

A time-discrete approximation of solutions to (|l.l|) fe (|1.2j) is constructed by application of Dc 
Giorgi's "minimizing movement" scheme to the information functional Ja^/i] = F at \(u) (defined 
on Borel probability measures) . We review this scheme below; see section 13.21 for a more detailed 
description. 

Let a partition JV of the time interval [0, +oo) be given, 

(1.31) 3> T := {0 = f° <£<■■■<£<•■■ }, r„ := i™ - t™" 1 , lim £ = V r„ = +oo. 

n 

Associated to the step sizes r = (r n ) ne N and the initial measure = u$L d G ^(R^), we 
consider the sequence (M") n£ N recursively defined by solving the following variational problem 
in ? 2 (R d ): 

(1.32) find M" G Cp2(R d ) which minimizes the functional M h-> — ^(M™" 1 , M) + J Q \ [M] . 

The measure M™ G J ) 2 at the nth time step — which serves as approximation of /i(i") = u(t™)L d 
— is thus determined as the minimizer of a variational problem involving the Wasserstein distance 
and F Q;A - 

Following this scheme, one obtains a family of piecewise constant approximating solutions 
M T (t) = M™ in each interval (t£ , £"], satisfying a suitable discrete version of (|1.22|) . We prove 
that in the limit as sup„ r„ J, 0, one recovers a weak solution fi = uL d of (|1.20|1 . Since the 
functional F a .\ is not displacement convex, we cannot follows the standard procedures developed 
in [S] . Strong a priori estimates are needed to pass to the limit and to characterize the equation 
satisfied by the limit function. 

Discrete dissipation estimates via auxiliary gradient flows. The standard strategy for 
deriving these kinds of estimates is to get as much information as possible from the discrete 
variational problems (|1.32|) . usually by studying the first variation of the functional to be minimized 
under suitable perturbations of the minimizer. The careful choice of such perturbations therefore 
plays a crucial role. 

In the present paper we propose a general strategy to find a sufficiently wide family of per- 
turbations and to obtain corresponding discrete estimates, even in a general metric setting. The 
underlying philosophy is simple: we perturb the minimizer M" of (|1.32p by moving it along the 
gradient flows S v generated by other, simpler functionals V defined on ^(R-^)- In order to obtain 
useful information, V should be ^-displacement convex for some k £ R, so that S v is ^-contracting 
and can be characterized by a suitable variational inequality, see Theorem 12 .41 and ()3.16j) . Assume 
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k > temporarily, for ease of presentation. Assume further that the dissipation of 3 at \ along the 
flow S v , 

(1.33) D J a ,xN : = hmsup — '■ — '■ 5 — , 

hio n 

is non-negative, possibly up to lower order terms. Then, the discrete solution M" satisfies the a 
priori bound 



A ■ 



(1.34) V[M£] + t„ D v J Q A [M™] < V^/™" 1 ] or, equivalently, — ^ !— ^ > D v J Q 

See Theorem 13.21 for the complete statement, with k € R. Wc shall refer to inequality (|1.34j) - 
or rather to its generalized form (|3.22p — as the "flow interchange estimate" for the following 
reason. The rate of dissipation r~ 1 (V[M™ _1 ] — V[M"]J of the functional V along the "discrete 
flow" generated by J a ,x through the minimizing movement scheme (|1.32p is bounded from below 
by the rate of dissipation D v 3 a ,X of $ a ,x along the continuous flow generated by V. It has the 
advantage of converting the differential estimate of 7 a ,x along a known evolution (|1.33|) into a 
discrete estimate for the approximation scheme of the flow for (|1.1[) . 

In the present paper we will repeatedly apply this principle with various choices of V: 

1. In Section \Hn\ and Lemma \JJH[ V[/x] = J Rd (d/j>, for ( € Cg°(R d ), generating the flow associated 
to the linear transport equation 

(1.35) dtv-dwivBC) = 0. 

This first step (which goes back to the original ideas of [27J ) shows that M™ is a weak solution 
of a discrete version of (|1.22p . 

2. In Lemma \2.6\ and equation (| 4 . 9 1) / V[fj] = m-2 [m] : ~ Jud \x\ 2 dfj,, corresponding to the dilation 
equation 

(1.36) d t v - div(vx) = 0, Sjv(x) = e^v^x) = D e -tV (x). 

It provides a control on the quadratic moments of the solution in terms of the behavior of J a ,X 
with respect to the dilations group Sj , sec (|1.12p . 

3. In Theorem \4-3\ = Jf[/i], the logarithmic entropy functional from (|1.13p . generating the 
heat flow 

(1.37) d t v-Av = 0. 

It provides the discrete prototype of the following a priori estimate for solutions u to (jl.ip , 

(1.38) H(u(T,-)) + c [ [ \B 2 a(t,x)\ 2 dxdt<H(u ) VT > 0; c := ^~\ v 

Jo Jr* ad{d + 2) 

This estimate is crucial for the passage to the limit in the nonlinear discrete equation. With a 
different technique, an analogous idea has been exploited in |24j . In order to capture the full 
power of the estimate hidden in the term D 5F aj A) which controls the second derivatives of u a , 
we will use in a very careful way the integration by parts rule proved by Gianazza, Savare and 
Toscani [21] and by Jiingel and Matthes [H] (see also [2H] for the development of an algebraic 
technique for dealing with such formulas). 

4. In Theorem \5.1[ V[fi] = ^K a .\[fj], the power entropy (|1.9ap . generating the porous medium flow 
(ll.3ap . It provides all the asymptotic estimates on the exponential decay of 3~C Qi a and $ a ,x 
of Theorem 11.21 This result is actually more subtle, since it also uses the particular relation 
(jl.lOp between information function F a .\ and the associated entropy H a .\. 

Open problems. The restriction to the range 1/2 < a < 1 — although natural for several 
reasons — is seemingly non-optimal with respect to where existence and equilibration results can 
be proven. The authors suspect that all of the aforementioned results extend mutatis mutandis 
at least to 

^ kr <a< 2- 



8 



DANIEL MATTHES, ROBERT J. MCCANN, AND GIUSEPPE SAVARE 



The reason is that on this wider range an a priori estimate of the type (|4.11j) continues to hold 
for sufficiently regular solutions. 

Still, even the wider range (|1.39p constitutes a rather unexpected restriction. In view of the 
properties of the intimately connected slow and fast diffusion flows, the natural lower limit for a 
should be a = 1/2 — 1/d (which is when the entropy H a \ loses geodesical convexity), while there 
should not exist any upper limit at all. 

Plan of the paper. In Section [2] below we collect some essential facts on the I/ 2 -Wasserstein- 
metric, its link with the entropy functionals (jl.9a[ b) and their gradient flows, and the related 
properties of the information functional (|1.8|) . 

As explained in the previous paragraph, Section [3] is devoted to the core ideas of obtaining 
dissipation estimates by using auxiliary gradient flows; its first part presents the basic heuristics, 
deriving the estimates in the elementary case of smooth flows in the Euclidean space R m . The 
general abstract results in metric spaces are developed in 13.21 

The proof of the existence result Theorem 11.11 based on the convergence of the minimizing 
movement scheme (|1.32[) . is carried out in Section [3J Besides the general ideas we have just tried 
to explain, the crucial estimate and many technical steps are contained in Lemma 14.41 

The large-time asymptotic estimates from Theorem 11.21 are derived in Section as a direct 
application of the results of Section [3] Section f5T4l and f5T3l might be of independent interest. In 
the first, a formal argument for estimate (|1.25[) is presented, inspired by Carrillo and Toscani's 
approach to the special case a = d = 1 [15] . In the second, the intermediate asymptotics of 
the unconfined gradient flow, i.e. equation (|l.ip with A = 0, are investigated. The crucial tool 
is a rescaling property for the discrete solutions of the minimizing movement scheme (|1.32j) for 
functionals satisfying ()1 . 12(1 under the action of the dilation group d r . 

2. Preliminaries 

The framework of the proofs will be that of measures on R d and the L 2 -Wasserstein distance. 
For simplicity of notation, we restrict to probability measures from now on. Due to the scaling 
invariance of (|1.1[) . it costs little generality to consider only probability densities as solutions, i.e. 
solutions of unit mass. In fact, if u(t, x) is any solution to (jl.ip of mass m > in dimension d > 1, 
then the rescaled function 

u(t,-) :=m- 1 Q d u(t,Q-) = m-%-iu(t, •), with Q ■= m (2a-i)/4+(2a-i)d 
is another solution to (jl.ip , and is of unit mass. 

2.1. Wasserstein distance. Denote by J'(R d ) (resp. J > 2(R d )) the space of the Borel probability 
measures on R d (resp. with finite second moment rri2[/i] = J Rd |x| 2 d^); further, J""(R d ) (resp. 
y r 2 (R d )) is the subset of all the measures which are absolutely continuous with respect to the 
d-dimensional Lebesgue measure L d . Convergence in [P(R d ) refers to the weak convergence of 
measures, 

(2.1) Mn-^Moo in3>(R d ) iff J cf>d^ n ^J <£d/x for all € C£(R d ). 

Given a measure /i € 3 > (R d ), its push-forward r#/j, e V{X) under a Borel map r : H d — > R d is 
defined by 

for all Borel sets A C X. When /i = uL d is absolutely continuous with respect to the Lebesgue 
measure L d and r is an injective differentiable map with non singular differential L d -&.e., then 
jj. = <C L d is concentrated on r(R d ) and its density u = &fl/&L d can be expressed through 
the formula 

(2.2) u(y) = " o r- 1 ^) for £ d -a.e. y G r(R d ). 

| det D r 

Notice that the particular choice of the dilation map r(x) = rx, r > 0, corresponds to (jl.lip . i.e. 
r#n = (rid) # ^ = (d r u)L d . 
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For measures Mo, Ml G !P 2 (R d ), denote by W 2 (moj Mi) their L 2 -Wasserstein distance 

(2.3) W 2 2 (mo,Mi) = min f \x - y\ 2 d 7 (.x, y), 

where r(/xo,Mi) is the set of all couplings between /xo and Mij i.e. all probability measures 7 on 
R d x R d whose marginals are Mo and /ii respectively. We collect in the following Theorem some 
useful properties of the Wasserstein distance, whose proof can be found, e.g., in Villani's book |4T)] . 

Theorem 2.1. (CP 2 (R d ), W2) is a complete metric space and a sequence fi n G 3 3 2(R c! ) converges 
to (Jt €. 3 3 2(R d ) if and only if it converges (|2.1|) ot J'(R d ) and rri2[ix„] — > m 2 [M]- W 2 is lower 
semicontinuous m each argument with respect to convergence in J'(R d ), and it is convex in each 
argument, i.e. for p,, /xo, Mi G 3 3 2(R d ) $ G [0, 1], one has 

W 2 (m,(1-0)mo + 0Mi) < (l-fl)Wa(/*.W)) + W 2 (M,Mi)- 

J/ /io * s absolutely continuous with respect L d , then the minimum in (|2.3|) is achieved by a unique 
coupling j op t which is induced by a transport map t = tj^J : H d — > R d , i.e. 7 opt = (id, t)#[j,o, 
Mi = t#M0; arl ^ 

(2.4) ^ 2 2 (mo,Mi)=/ |t(x) -x| 2 d Mo (x). 

jR d 

In i/ws case f/ie curve m# " ((1 — 1?) id +t9 t)^Mo / or ^ € [0, 1] provides a (constant speed, mini- 
mizing) geodesic connecting /xo and Mi, i.e. satisfying 

(2.5) W 2 (m^ ,M^) = K-^i|^ 2 (mo,Mi) /waH O , #i G [0, 1]. 

Following McCann [32], a functional U : 3> 2 (R d ) -> ( —00, +00] with proper domain Dom(U) C 
[P 2 (R ) is called displacement n-convex (or geodesically n-convex) if 

(2.6) U(Mtf) < (1- Wmo)+^(mi)-|(1-^VF 2 (mo,Mi) Vi?g[0,1], mo, Mi e Dom(U), 
where /!$ is a geodesic connecting (j,q , /ii as in (|2.5p . 

2.2. Integral functionals in IP^R"). Since the natural framework will be the space ^(R^) of 
probability measures, we will use calligraphic letters J{, 5", to denote the natural extension of the 
entropy H and the information functional F to ^(R^), with the convention that they take the 
value +00 when are evaluated on a measure fi j£ 3> 2 (R ). 
For every measure m = uL d G y^(H d ) we thus set 

(2.7) 5 a x\ji] :=F aX (u) = — [ I D(u Q )| 2 dx + - [ \x\ 2 u(x) dx, provided u a G M /1 ' 2 (R d ), 

2a J R d 2 J R d 

otherwise 5a, a [m] = +°° ■ Likewise, the associated entropy functional is given by (recalling our 
convention that m(u) = 1) 

(2.8) DC a ,x[t4~H a)A («) = -^ 7 -( / u^+VSdx-l) +^ / \x\ 2 udx if a > 1/2, 

a-l/2V7 R d / 2 ./pd 

(2.9) M 1/2iA [m] :=-ffi/a,A(«) = ©1/2 / u\ogudx+-^ \x\ 2 udx, 

provided u a+1 ^ 2 G ^(R**), or ulogu G L 1 (R d ), respectively, and !H q .a[m] — +°° otherwise. 
Notice that J£i/2,o coincides with the usual logarithmic entropy functional associated to (|1.13j) . 

Our choice A QjA = A 1/2 ((2a - l)d + 2)~ 1/2 > is motivated by Corollary l2~3l and Remark |2~T1 

Some preliminary properties of the information functional 3^, a arc collected below. Since 
M G ;P 2 (R d ) and 3a, a [m] = ^0,0 [m] + ^ m 2[ML man y properties of 1a,\ can be reduced to the 
corresponding ones of 3^,0- 
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Lemma 2.2 (Properties of the perturbed information functionals). For every a G [1/2,1] and 
A > the functional 'Ja.x is lower semicontinuous with respect to convergence in J , (R d ) ; and 
for every sequence fx n — a X J a L d with a n G W 1 ' 2 (R! 1 ), weakly converging to [i on 3 ) (R <i ) with 
sup n S^oIMh] < +oo, one has 

(2.10) (J, = a 1/a H d , a G W h2 (R d ), a n -> a strongly in L 2 (R d ) and weakly in W X ' 2 {K d ). 
^q.a is convcjQ (strictly, if ex > 1/2,), i.e. for fj,o, Mi G T 2 (R d ) and G (0, 1), one /ias 

J Q , A [(1 - 0)^0 + fyl] < (1 - &)* a ,\ M + 09a,A[M 

wwf/i equality only if fio = fj,±, or if a = 1/2. Moreover, 3^ at \[/j] is finite if and only if one of the 
following two (equivalent) conditions holds: 

(1) [i = vH d with v G W£ c 1 (R d ) and i/*" 1 ! Dv\ G L 2 (R d ); 

(jgj /it = uC d and « a +V2 e W^(R d ) and v~ l l 2 \ T>v a+l / 2 \ G L 2 (R d ). 

// i/ws is tfie case, then v G iy 1 ^(R d ) /or p = 2/(3 - 2a), v a+1 / 2 G V^ 1 ' 1 ^), and J admits the 
equivalent representations 

(2.11) y a ,oM = ^ / |D^| 2 dx^ / ^-^D^d^e 2 / ,- 1 |D^+ 1 / 2 | 2 dx. 

Proof. We start by proving lower semicontinuity of $ a ,Q- Let a sequence \i n = o~l/ a !L d of measures 
in y(R d ) with limit \x have a n G M /1,2 (R d ) and sup„ Ja.xl^n] < +oo. By Rcllich's compactness 
theorem, there exists a subsequence (still denoted by /i rl ) with a n strongly converging in L 2 oc (R d ) 
to fj G L a (R d ) as n -> +oo. It follows that = a x / a Si d , and 

liminf / |Dcr„| 2 da;>/ |Do-| 2 da;. 

Lower semicontinuity of the perturbed functional 3 a> \ (with A > 0) follows from the lower semi- 
continuity of the second moment rri2[-]. 

It follows further that the sequence a n above converges strongly to a in L 1 / Q (R d ), since 
J Rd <Jn /a dx = 1. As l/o < 2, and a n is bounded in L 2 *(R d ), with 2* = 2d/(d- 2) > 2, the 
Sobolev embedding theorem implies strong convergence of a n to a in L 2 (R rf ). 

In order to prove the second part of the statement and (|2.1ip . observe that for a — v a G 
W 1,2 (R d ) it is easily checked that both v = a 1 ^ and v a+1 / 2 = a 1+1 ' 2a belong to W^(R d ), and 
that (l)-(2) are satisfied. 

Conversely, if < e < v(x) < £ rf -a.c. in R d , then the three conditions v G W lo ' c (R d ), 
v a+i/2 g W^(H d ), and u 1 /" G W^(R d ) are equivalent, and any of them implies 

a + 1/2 

showing (|2.11|) in this case. By a standard truncation and approximation argument, this property 
extends to the general situation. 

Concerning the integrability of D v, we apply Holder's inequality to the product Dd = (v a ~ l D v)v 

/ |Dw| p da;<(/ {v a ~ x V)v) 2 Ax\ ( \ vdx] 2 . 

Next, we observe that v a ~ x Dv G L 2 (R d ) when S^aM is finite, and trivially v G L 1 (R d ). From 
here, the Sobolev embedding theorem implies that v G L p (R d ). An analogous argument, based on 
the splitting Di/* +1 / 2 =j) 1 /2(„-i/2 Dl) a+l/2j withw 1 / 2 G L 2 (R d ) and v" 1 / 2 ] D v a+1 ' 2 \ G L 2 (R d ), 
shows that v a+1 / 2 G VK 1,:L (R d ). 

It remains to prove convexity of S^a.o- To this end, observe that the function 

{x, y) G R+ x R d i * x r \y\ 2 



We recall that 3^ x is n0 * geodesically convex. 
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is jointly convex iff r = 2 (a — 1) 6 [—1, 0]; this convexity is inherited by J a ,o, thanks to the second 
representation in (|2.1ip . In fact, the map above possesses a positive definite Hessian, and hence is 
strictly convex, for — 1 < r < 0, corresponding to 1/2 < a < 1. In the borderline case a = 1, the 
strict convexity of J | D v\ 2 dx is obvious. The situation a = 1/2 has been discussed by Gianazza, 
Savare and Toscani [SJ. Finally, J a ,x differs from 7 a ,o only by -|m2[-], which is convex itself. □ 

We recall that the Wasserstein slope of a functional 9 : ^(R ) — > {— oo, +oo] is defined (as in 
any metric space) by the formula 

(2.12) \dS\(fi) :=limsup V/i G Dom(S). 

Corollary 2.3 (Fundamental entropy-information relation). For every \i € ^(R^) with!K a fi[fj] < 
+oo we have 

(2.13) J q , [m] = \dX a , Q \ 2 (fi). 
Moreover, for every A > 

(2.14) ? Q , A [/i] = \d% aiX \ 2 (fx) + (6 a - 2)A a , x M a ,\[t4- 

Proof. By [3j Theorem 10.4.6] the slope \d!K a} x\(fx) is finite if and only if /j, = vL d with u"+ 1 / 2 g 
W 1 ' 1 (R d ) and there exists a £ G £ 2 (^; R d ) such that 

(2.15) < = 9 a Di) a+1/2 ; in this case \d"K a ,x\ 2 {^) = ( |£ + Az| 2 d/z. 

In the unperturbed case A = A = 0, relation (|2.13p follows from the third representation of 3 a fl 
given in (|2.1ip . In the general case, since £ = QaV^ 1 ^ 2 D v a+1 / 2 with j Rd \£\ 2 vdx = $ a ,o[(A < +°° 
and S a — 2 = (2a — l)d, we obtain 

|9JC Q , A | 2 (/i) = / |£ + Aafvdx = f \£\ 2 vdx + 2AQ a [ x ■ Dv a+1/2 dx + A 2 m 2 [yu] 

= ?a,o [lA - 2AdO a / v a+1/2 dx + A 2 m 2 [/x] 

= S^oM - A(5 Q - 2)5C„, A M + iA 2 5 Q m 2 [/i] 
= ?'«,AM-A(<y a -2)5£ aiA M, 
where we applied the relation A 2 = X/S a and the integration by parts formula 

(2.16) / x -Bg(x)dx = -d [ g(x)dx to g = v a+1/2 £ W hl (R d ). □ 

Remark 2.1. The specific choice of A in (|2.14p is probably best understood from an alternative 
derivation of (|2.14p . which only uses relation (|2 . 13[) and the scaling invariance (|1.12p . To avoid 
technical details, we shall sketch this argument in an analogous finite-dimensional framework. 
Consider a smooth function Vo : R m — ► R which is positively homogeneous of degree —5/2 + 1. 
Hence, its squared gradient Uq(x) = |Wo(s)| 2 is homogeneous of degree —5 (homogeneity in the 
cuclidcan setting corresponds to the scaling invariance (|1.12p in the Wasserstein space). Setting 
Va(x) = Vo(x) + ^ A|a:| 2 , and observing that W (x) ■ x = (—(5/2 + l)V (a;) by homogeneity, one 
finds 

| VV A (a-)| 2 = |VV (s) + Ax| 2 = |VV (a;)| 2 + A 2 |x| 2 + 2AVV (x) • x 

= U (x) + A 2 \x\ 2 - A(S - 2)V (x) = U (x) + ^A 2 d>| 2 - A(S - 2)V A (x). 

Therefore, with A := A 2 <5 and U\(x) :— Uo(x) + ^X\x\ 2 , one arrives at the analogue of (|2.14p . 

U x (x) = |VV a (.t)| 2 + A(6 - 2)V A (x). 
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2.3. A metric characterization of the Heat/Porous medium flow. For later reference, we 
recall the "Wasserstein characterization" of the heat /porous medium equation, and its relationship 
to displacement convexity of the functionals ^K a ,\- 

The following theorem summarizes some of the results from McCann |32j . Otto [35] . Ambrosio, 
Gigli, and Savare [SJ; see also Carrillo, McCann, and Villani [T2] and Sturm [37]. For notational 
convenience, we introduce the right time derivative 

(2.17) «' + *»-«'> 

at n 

for real functions ( : [0, +oo) — > R. 

Theorem 2.4. The functional "K a ,\ is displacement A- convex, in the sense of (|2.6[) . It generates 
a k-flow in CP2(R ).' for a given initial condition /xo = uoL d G Dom(J£ ct >) C IP^R ), there exists a 
unique, locally Lipschitz curve [i : (0, +oo) — > CP2(R d ) with fi t = u t £j d E Dom^Q,^) C Dom(!K Q .,\) 
for every t > 0, which satisfies 

(2.18) \^j:Wi(nt,v) + ^WHfit^) + X a M <3f Q , A H Vi/eDom(3{ Q , A ), t > 0, 

and attains the initial condition, lim t |o Mt = Mo wi ^(R 6 *). TTie associated densities u t constitute 
a weak solution of the porous medium equation 

d t u t - 6Au" +1/2 + A div{xu t ) =0 m (0, +oo) x R d , 



with the values for O = Q > and A = A a ^\ > given in (|1.4|) . 

Remark 2.2. The case a = 1/2 and A = of the previous Theorem concerns the relative entropy 
functional 

(2.19) = 25C 1/2j0 [m] / ulogudx, ft = uL d e y r 2 (R d ). 

Up to a factor two, the Wasserstein gradient flow ji t = u t H d of "K = 23-C 1 / 2 ,o corresponds to the 
heat equation 

(2.20) dtu - Au = in (0, +oo) x R d , 

whose solution can be expressed by convolution with the initial condition, 

(2-21) u t (x) = 1 J expHz - 2/| 2 /4t) d Mo (y). 

In particular, u £ C°°((0, +oo) x R d ), and it is strictly positive. 

2.4. First variation in the Wasserstein space. As we are concerned with the gradient flow 
of GFq.a, we need to calculate its first variation in the L 2 -Wasserstein-metric. Here, we resort to 
a very direct approach to calculate the latter, which was introduced by Jordan, Kindcrlchrcr and 
Otto [27 J. The following notation is needed. For any smooth, compactly supported vector field £ 
on R d , there exists an associated smooth flow map X s : R d — ► R d , satisfying 

(2.22) ^-X s (x) =CoX s (x), X°(x)=x, 

ds 

for all x S R d and all seR. 

Lemma 2.5. Let a vector field £ € C£°(R d ) and a measure fi = uL d e T^i^) with $ a ,\\pi\ < °o 
oe given. Then the map s t— > 3" q ,a[X^./x] is differentiable at s = 0, and 



x • d/i(x) 



(2.23) —I J a , A [X^]=N Q , A [M;C]=H a [/i;C]+A / . 
dsU=o * J Rti 

where the nonlinear operator 3\f Q is defined in terms of a := w Q as 

(2.24) 3\U//;C] = — -/ faD(divC)-Dcr 2 + 2DCDo--Do-+(2a-l)(divC)|Dcr| 2N )d.T 

2a J R d V / 
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Remark 2.3 (First variation and weak formulation). Notice that the functional Na,[/z;£] corre- 
sponds to the expression N a (u; C) related to the weak formulation of fll.lj) we discussed in Remark 
11.11 in fact 

(2.25) NafaC] =N a (u;C) if A* = e T(R d ). 

Proof. It is sufficient to discuss the case A = 0. The evaluation of the first variation for the linear 
contribution can be found in e.g. |27j . 

Introduce the pushcd-forward measure and its density. 

H a = (X u s = 

and write a s = uf. Moreover, for y G R d , define the volume distortion V s (y) = det( D X s (y)) > 0. 
According to formula 



(2.26) u s (X s (y)) = u(y) ■ {V s (y)) \ a s {X s (y)) = a(y) ■ (V s (y)Y 

Changing variables x = X s (y) under the integral, we obtain 



3 a n \us\ = — / I D x cr s (a-)| 2 da; = — / I ( D x cr s ) o X'' 



(y)|V 8 fo)dy. 



Taking into account that X° is the identity, it follows 

(2.27) iL| 5 a ,o\Ps] = - f D x a(y)—| [(D x cr s ) o X s (y) V s (y) 1//2 1 dy. 
dsls=o aj R j dsls=oL x J 

Differentiation of a s in (|2.26|) with respect to y gives 

(2.28) (D x a 3 ) o (X s (y)) ■ D y X s (y) = D y a(y)V s (y)~ a - aa(y)V s (y)~ (a+1) ■ D tf V a (y). 

By multiplying both sides by the factor V s (y) 1 ^ 2 and from the right by the matrix inverse of 
D y X s (y), one obtains a representation for the expression in the square brackets in (|2.27j) . 

(2.29) 

(D x a s ) o (X s (y))V s (y) 1/2 = V;(v)- (a ~ 1/a) ( D v <r(y) - aa(y)D y (logV s (y))) ' (B y X'(y))-\ 

In order to evaluate the s-derivative of the right-hand side at s = 0, we make use of the following 
elementary identities: 

(2.30) V (y) = l, A, V.(») = divC(y), "H (D tf X'(y)) _1 =-D y C(»). 

US Is— CIS Is— 

The first identity is immediate. The second follows from the well-known relation between de- 
terminant and trace, det(l + sA) = 1 + stiA + o(s) for arbitrary square matrices A as s — ► 0. 
The last identity is a consequence of the differentiation rule for inverse matrices, d(A _1 )/ds = 
—A^ 1 ■ (dA/ds) ■ A^ 1 . Applying <|2.30[) to the time-derivative of equation ()2.29[> . one obtains 

±\ s jD x o- s (X°(y))V s (y) 1/2 ] 

= -(a- l/2)div„C(i/) B y a(y) - aa(y) B y div y £(y) - D„ a(y) ■ B y C(y). 
Inserting this into the right-hand side of (|2.27|) yields the desired result (|2.23[) . □ 



Another variation of of subsequent relevance is the one along the dilation group X s (x) 
e~ s x. A direct computation yields 

Lemma 2.6. The derivative ofJ a ^\ along the dilation group X s (x) — e~ s x is given by 

(2.31) -H 3 r a,o[Xi/x] = <5a3 r «,oN, -H ^[Xi/i] = 5 Q J a)0 -Am 2 [/x] ! 
dsls=o ^ dsls=o * 

where 5 a := (2a — l)d + 2 is the coefficient introduced in (11.41) . 
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Proof. If fi = uL d with a = u a G W 1>2 (R d ) then, recalling (pO]l . X^/x = u s £ d with u s and 
cr s = u" satisfying 

u s (x) = e ds u{e s x), a s {x) = e ads a{e s x), D a s (x) = e {ad+1)s D cj(e s 3;), 

so that a simple computation yields 

(2.32) ?a,o[X^] = e 5 ° s J Q ,o[/i], m 2 [X^] = e - 2s m 2 M. 

Differentiating ((2321) we obtain (|2~3T|) . □ 

Notice that (|2"3Tj) is consistent with (j2~2"3")h applied to = -a; = -D(|x| 2 /2), which gen- 
erates the dilation flow X s . Although £ is not compactly supported, Lemma 12.61 above can be 
derived from Lemma 12.51 by an approximation argument. Fix a smooth real function £ such that 
C(r) = 1 if r < 1 and ((r) = if r > 2 and wc take a family of smooth functions 

(2.33) C«(aO := -±\x\ 2 C(\x\/R), C R (x) =B( R (x) = -((\x\/R)x where f(r) = CW+^C'W, 
so that 

(2.34) - Ch(x) T ^M 2 , Cr(^) -x as i? T +oo Va; G R d . 
It is not difficult to check that 

(2.35) Urn N a [ti;t R ] = 7T f \V <r\ 2 dx = 5 a ? a , [»}. 

-RT+oo ZCt Jj^d 



3. Auxiliary estimates for metric gradient flows 

This section contains two essential estimates. The first inequality (|3.22|) . which we will refer to 
as "flow interchange estimate" , gives a precise meaning to the following observation: the dissipation 
of one functional along the gradient flow of another functional equals the dissipation of the second 
functional along the gradient flow of the first. This estimate will be applied repeatedly in the 
next section. The second inequality (|3.25[) provides an equilibration rate for a gradient flow whose 
potential is the dissipation of another functional along its own gradient flow. Notice that, in view 
of Corollary [2T3l the information 3 aj \ and the entropy ^K a ^\ are connected exactly in this way. 

3.L Estimates in the smooth finite dimensional case. In order to clarify the main ideas, 
we shall first derive analogous estimates in a smooth, finite dimensional setting. Suppose that 
U,V : R m — > R are smooth functions and let us consider the associated gradient flows S U ,S V : 
[0, +00) x R m — > R m . For every 11,06 R m the curves Ut = 5^(u), Vt '■= S^(v) are defined as the 
solutions of the ordinary differential equations 

(3.1) u' t = — VU(ut), v' t = — W(ut) with initial condition uq = u, vq = v. 

It is a trivial calculation to check that V is decaying along S u at precisely the same rate that 11 
is decaying along S v . A quantitative estimate can be obtained in terms of the rate of dissipation 
of U along S v : 

(3.2) D v Ulv]:=-±U(S?(v))\ t=0 . 
Lemma 3.1. The solution Ut = S^ U (w) of the XL-gradient flow satisfies 

(3.3) ^-V{u t ) + B v U[u t }=0. 

at 

Suppose moreover that V is n-convex, D 2 V > kI, with some constant k > 0, and that U can be 
written as 



(3.4) 



U(u) = I VV(u)| 2 + 2(0 - k) V(u) 
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with some 9 > 0. Then Ut converges to the unique minimum point M m in ofV (and li) as t f +oo 
with explicit exponential rates, 

(3.5) ^\u t ~ u min \ 2 < V(u t ) - V{u min ) < (V(uo) - V(u mi n)) exp(-4K0i), 

(3.6) U(u t ) - U(u min ) < (U{u ) - U(u min )) cxp ( - 4(« A 6)6 1). 
Proof. (|3.3|) follows immediately by the identity 

(3.7) D V UM = (VU(u t ), W(«t)> = Hm/i-^VK) - V(5,^K))) 

(3.8) = lhn/r^VK) - V(«t+ h )) = 

Now assume that V is K-convex, and U is given by (|3.4[) . The uniform convexity of V entails the 
existence of a unique minimizer u m i n satisfying 

(3.9) ^\u-u min \ 2 <V(u)-V(u min )<^-\DV(u)\ 2 for all u e R m . 

Z ZK 

Thanks to (|3.4[) , and since > 0, u m in is also the unique minimizer of U. A direct computation 
shows 

-B v U(u) = 2{D 2 V(u)BV(u) + (6- «)DV(u), DV(u)) > 26»|DV(u)| 2 > 4k0(V(u) - V(u min )). 

An application of Gronwall's estimate to (|3.8p yields (|3.5p . In order to show the second estimate 
l|3l)|) . define 

:= |DV(m)| 2 , Dtf(u) = 2D 2 V(u)DV(u). 
Since U satisfies ()3.4|) . it follows that 



- ^lt(ut) = |DUK)| 2 = |D*(«t) - 2 K DV(u t ) + 26BV(u t )\ 2 
at 

= |D#(« t ) - 2«DV(w t )| 2 +46* (D*(u t ) - 2kDV(u 4 ), DV(u t )) + 40 2 |DV(u t )| 2 > 40 2 *(u t ), 
where we have used that 

(D#(«t) - 2 K DVK),DV( Ut )> = 2(D 2 V( Mt )DV(u t ),DV( Mt )> - 2 K |DV(u t )| 2 > 0. 
Since, moreover, 

(3.10) U(u)-U(u min ) = y(u) + 2(6- K )(V(u)-V(u miu )) < (1 + k^QB- «)+)*(«) = —*(«), 
we eventually obtain 

(3.11) - -(UK)-U(« min )) >— (U(«t)-'lX(« I nm))=4e(«Afl)(U(« t )-U(« inin )). □ 

Remark 3.1. Estimate ()3.5p is sharp: one can simply take m = 1 and V(u) := fit 2 , so that 
U(it) = k0u 2 and Ut = uo exp(— 2ndt). The same example shows that also Q3.6p is sharp, at least 
in the case > n. 

3.2. Gradient flows in metric spaces. We shall now extend the estimates of the previous 
section to functionals U,V and their gradient flows defined in a complete metric space (X, d). 
Since the previous calculations heavily rely on the use of gradients and scalar products, it is not 
immediate how to generalize the results in the absence of a linear structure. 

n-convexity and n-flows. Let V : X — > (— oo,+oo] be a proper and lower scmicontinuous 
functional, with proper domain Dom(V) = {x £ X : V(x) < +00} . We will work under the 
additional assumption that V is geodesically K-convex: every uq,Ui £ Dom(V) can be connected 
by a (minimal, constant speed) geodesic •§ £ [0, 1] 1— ► u$ £ Dom(V) such that 

(3.12) d(u*, u n ) < |0 - 77|d(uo, ui), V(u*) < (1 - 0)V(uo) + 0V(m) - - tf)d 2 (u , ui) 
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for every 0,rj £ [0, 1]. Geodesic convexity with n > implies the existence of a unique minimizer 
Wmin for V. In fact, recall the definition of the slope from (|2.12p , 

(3.13) |0V (u) := limsup 

d(u,v) 

Then u m i n satisfies 

(3.14) ^d 2 (u, u min ) < V(tt) - V( Umin ) < ^-\dV\ 2 (u). 

A gradient flow of V, which is referred to as a K-flow, is a continuous semigroup S v £ C° (Dom(V) ; Dom 
i.e., 

(3.15) S^(S t v (u)) = Sj +h (u), \imSj, (u)=u for all u £ Dom(V), t > 0, 

/ij.0 

which satisfies the evolution variational inequality 

(3.16) ^^-d 2 (Sj(u),v) + ^d 2 (Sj(u) 7 v) + V(Sj(u)) < V(v) for all u, v £ Dom(V), t > 0. 
Equivalcntly, the K-flow of V is characterized in Ambrosio and Savare [H Thm. 5.7] by 

(3.17) \dV\ 2 (u) = lim V(M) = V(S ^ (M)) = Urn " } for all „ 6 Dom(V). 

ft,10 /l hlO h 2 

Minimizing movements. We shall now consider another lower semicontinuous functional It : 
X — > (-co, +oo] such that Dom(lX) C Dom(V). But we will not impose any particular convexity 
requirement on II. Thus, the construction of its gradient flow (which is not guaranteed to exist 
a priori) is much more difficult in general, and one cannot expect to find an appealing metric 
characterization like (|3.16p . Therefore, we shall not work on the continuous level; instead, we 
reproduce the decay estimates from the finite dimensional situation at the time-discrete level, in 
the framework of so-called minimizing movements described in p2 Chap. 2]. 

Our main assumption is that for every time step r £ (0, r ], and every U £ X, the functional 
defined on X 

(3.18) V i-» ^-d 2 {V, U) + U(V) admits a minimum point in Dom(U). 

2r 

Assumption (|3. 18|) expresses that time-discrete gradient flows of 11 can be obtained by means of 
the implicit Euler scheme. The discrete solution is constructed as follows: let a partition 3V of 
the time interval (0, +00) be given, with an associated sequence t = (t„)„ 6 n of time steps, 

(3.19) 3> T :={0 = t°<4 <•■■<£ <•••}, r n =tl-t n -\ lim t? = Vr„ = +00. 

n — >+oo * — ' 
n 

For later reference, introduce the projection of an arbitrary t > onto the partition by 

t T =rmn{t™>t\t™£'5> T }. 

Given further an initial datum U° £ X, we can recursively define a sequence (Z7™) ng N in X, such 
that [7™ minimizes the functional (|3.18|) with U := f7™ _1 , i.e. 

(3.20) ^-d 2 {U?, U™- 1 ) + U(U?) < ^-d 2 {V, f/™" 1 ) + U{V) for all V £ Dom(U). 
2r 2t 

Finally, in analogy to (|3.2p we set 

(3.21) D*U( u):= limsup MM. 

HO 

We are now in the position to formulate and prove the analogous estimates to ()3.3p . (|3.5|) and 
(133. 
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Theorem 3.2. Let (£7™) n >0 be an arbitrary solution of (|3.20p . Then, for every n > 1, the 
following "flow interchange estimate" holds: 

(3.22) V(U?) + T n D v U(U?) + ^d 2 (U?, U™~ 1 ) < V^ 1 ). 
Moreover, assume that k > 0, and that 

(3.23) U(u) = \dV\ 2 (u) + 2{6- k)V{u) for all u e Dom(U), 

with some 8 > 0. Then V and U have the same minimum point u m in> and for every n > 1, the 
following estimates hold, 

(3.24) (i + 4^t„)(v(c/; 1 ) - v(tw)) < W?' 1 ) - V(u min ), 

(3.25) (l + 49( K A6)T n )(VL(U?)~U(u min )) < UiU^ 1 ) - U(u min ). 



Before proving Theorem 13.21 we briefly comment on its applicability. Lacking any convex- 
ity property of 11, the existence of its gradient flow is not guaranteed a priori. However, the 
construction by means of (|3.20|) obviously provides a time-discrete approximation of the sought 
time-continuous flow. Denote by U T : (0, +oo) — ► Dom(U) the piecewise constant interpolant of 
the discrete values on the grid 3V (|1.31|) defined by 



(3.26) U r (t):=U? if te (t?- 1 ,**] 



A typical strategy to obtain the gradient flow of U is to pass to the limit as sup„ r n J. 0, in (a 
suitable subsequence of) the time-discrete flows U T . In order to obtain the limit by a compactness 
argument, r-independent a priori estimates for the U T are needed. Formula (|3.22p provides a 
powerful tool to derive such estimates, as will be seen in section [H where (|3.22[) is applied with 
U = "J a ,\, and various choices of V (see also chapters 2,3] for further examples). 

On the other hand, estimates (|3.24[) and (|3.25|) are used to give a quantitative description of 
the equilibration behavior of solutions to (jl.lj) in section [5] Notice that by Corollary 12. 3[ the 
functionals 3 a ,\ an( l ^a,\ are related exactly in the way (|3.23p . 

Proof. In order to prove (|3.22p . choose V :— SY(U") with ft, > in the variational inequality 
(ECU for U. This gives 

(3.27) u(u?)-u(s v h ([/;)) < ^-(d^srAK),^- 1 ) - d'iu^ur 1 ))- 

Dividing by ft > and passing to the limit as ft J, 0, the variational characterization (|3. 16|) of V, 
with t = 0, u = U? and v = U?- 1 , yields 

(3.28) D V U([/;) < T-^ViPT 1 )- V(K?) - ^d 2 (U?, U?~ x )) , 

which is estimate (|3.22p . 

Let us now suppose that U is as in (|3.23p . with k > 0. The existence of a (unique) minimum 
point it m in of V follows easily from the convexity assumption (|3.12p and the completeness of X 
(see e.g. Lemma 2.4.8]). Inequality (|3.14p and the fact that |9V|(u m i„) = show that u m - m is 
also the unique minimum point of II. 

From now on, we assume without loss of generality that V(u m i n ) = lC(it m i n ) = 0. Since, by [H 
Thm. 5.7], 

(3.29) limsup |9V|2( " } ~ gS^M] > 2 K\dV\ 2 (u) for all u G Dom(|<9V|), 

hio li- 
the second characterization (|3.17p of the re-flow induces a lower bound on D V U, 

(3.30) B v U(u) > 2n\dV\ 2 (u) + 2{6 - n)\dV\ 2 (u) = 29\dV(u)\ 2 > 40kV{u), 

where the last inequality follows from the re-convexity of V, see Q3.14p . Insert ()3.30p in (|3.22p and 
neglect the non-negative term «<i 2 (C7™, U™~ 1 )/2 to obtain estimate (|3.24p . 
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It remains to prove the last inequality (|3.25|) . Estimate the difference on the right hand side of 
(|3.27p by the triangle inequality to find 

(3.31) U(K?) - U(S£(E£)) < d(S%(U?), U?) (d(S£(K?), U^ 1 ) + d(U?, U^ 1 )) ■ 

Divide this inequality by h and pass to the limit h J, 0, then use the second variational character- 
ization (|3.17|) of S v to identify the limit on the right hand side. In combination with p.30p . we 
obtain the two-sided estimate 

(3.32) 29r n \dV\ 2 (U?) < r n D v U([/; 1 ) < \W\(U?)d(U?, U^ 1 ). 
This leads further to 

(3.33) 20r n |0V|(K?) < d(U?, U?' 1 ), 46(9 A k)t*U(U?) < d 2 (U?, U^ 1 ). 

For the derivation of the second estimate above, we have made use of (recall U(u m i n ) = 0) 

(3.34) U(u) < °-^\dV\ 2 (u), 

K 

which follows in the same way as (|3.9|) in the finite-dimensional case, simply using (|3.23[) instead 
of JS3J), and ([3H|) instead of f3J0|) . 

Direct substitution of (|3.33|) into the characterization (|3.20p of the time-discrete flow for U, 
with V := f/™ -1 , gives a bound similar to (|3.25p but with the smaller constant 26(9 A k). A more 
refined estimate is needed to recover the better constant. 

To this end, introduce the so-called De Giorgi's variational intcrpolants: for r G (0, r n ] we define 
Z7™' r a minimizcr of the functional V h-> ^d 2 (V, U^' 1 ) +U(V) , thus obtaining a path joining {/" 
with {/" (which can be chosen equal to £/™' T "). Replacing r„ with r, and C/™ with [/™' r , estimates 
([3331) and monotonicity of IX, i.e. U(U^ r ) > 11(17?), see [1 Lemma 3.1.2], yield 



(3.35) 46>(6»AK)r 2 lI([/™) < 46(9 A n)r 2 U(U^ r ) < d 2 (U^ r , U^ 1 ) Vre(0,r„]. 

The crucial inequality satisfied by U^ r is provided by p2 Thm. 3.1.4], which shows that the map 
r i ► d(L^< r , t/™- 1 ) is Borel and 

(3.36) l[([/^) + -^d 2 ([/;,L7;- 1 )+ ^d 2 (U^ r ^U?- 1 ) dr < U^ 1 ). 

Inserting ([3331) and ([3~3"5)) into (pHS]) we eventually get |3~23)l . □ 

The following remark shows that the discrete estimates (|3.24j) and (|3.25j) give rise to equilibra- 
tion estimates analogous to (|3.5jl and ()3.6|) . 



Remark 3.2. Let (A™)„>o be a sequence of non-negative numbers, and let At be its piecewise 
constant interpolant, taking the value A™ in the interval (£™ -1 ,i™]. For a given time t > we set 
t T := inf{i£ : > t, k G N}. Assume that 

(3.37) (1 + ct„)A^ < A';' 1 for all n G N and some c > 0. 
Then, with c T (t) := r^ 1 log(l + cr„) for £ G (£™ _ \ £"], we get 

(3.38) A T (t) < A T (s)exp(^- J c x (r)dr) V<> s > 0. 

If A T is uniformly bounded, Holly's Theorem shows that any sequence Tk of partitions admits a 
subsequence (still denoted by Tk) and a limit function A such that lim^oo A Tk (t) = A(t) for every 
£ > 0. If limfcjoo t Tk = £ for every £ G [0, +oo) then we eventually get 

(3.39) A(t) < A(s) cxp ( - c(t - s)) . 
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4. Existence of Solutions 

The existence proof follows the ideas first developed by Jordan, Kinderlehrer and Otto [57] and 
later generalized in Ambrosio, Gigli and Savare [3]. First, a family of time-discrete approximative 
solutions is constructed by means of the variational minimizing movement scheme, which has 
been introduced in section T3.2I above. The decay of the information 3 a ,\ is immediate from the 
construction and leads to a priori estimates. The latter provide weak compactness of the set of 
discrete solutions, and thus allows to conclude the existence of a time-continuous limit curve. 

In order to show that the limit curve constitutes a weak solution to the gradient flow, we need 
to pass to the limit in the non-linear discrete equation. Additional compactness is needed for this 
step. A sufficient a priori estimate is obtained by evaluating the dissipation of the logarithmic 
entropy along the discrete flow. 

The procedure is thus similar to the one developed in Gianazza, Savare and Toscani [24] . 
However, we will follow a different strategy to derive the a priori estimate and to prove the 
convergence of the scheme. 

4.1. The semi-discrete scheme. For the reader's convenience, we review the main steps of 
the semi-discretization procedure in a series of lemmas. For any further details, we refer to the 
exhaustive treatment in [3]. Given a partition TV of [0, +oo) induced by the sequence of time- 
steps r = (r n ) nS N as in (|3.19p . and an initial measure M° € 2*2 (R d ), we consider the sequence 
(M™) ne N recursively defined by solving the following variational problem in D>2(R ): 

(4.1) find G < J > 2 (R d ) which minimizes M h-> -^-W 2 2 (M™ _1 , M) + 1 a \[M). 

The existence and uniqueness of a minimizer follows by standard methods from the calculus of 
variations, employing the continuity and convexity properties of W 2 and 5" Qj a collected in Theorem 
CTand Lemma O 

Lemma 4.1 (Basic discrete estimates). For each choice of t and M° G y 2 (R d ), there exists a 
unique sequence (M™)„ s n C CP 2 '(R d ) solving (|4.1j) . The information functionals S^^fM"] are 
finite and monotone non-increasing with respect to n, 

(4.2) j QiA [m;] < j QiA [a/;- 1 ] < j q , a \m% 

Moreover, the quadratic moments and Wasserstein distances between measures at consecutive time 
steps satisfy for every N > 

00 / H/ ( A/T n A/f n ~^ U 2 

(4.3) VJ 2[ ' T ' T ' ) <2? atX [M% sup m 2 K]<2m 2 [M T °]+<J t> . A [M T ]. 

, V T n I Kn<N 

n—1 — — 

Proof. By the minimality property of Af™, one has 

(4.4) -J—W$ (Af™ _1 , M ™) + J q .a[m;] < J^fM™- 1 ], 

2r n 



which induces (|4.2p and the first inequality of ()4.3p by summation over n = 1, 2, . . .. In order to 
prove the second bound, we sum up the triangular inequalities 



\/m 2 [M«] < ^[jwr 1 ] + w 2 (m™ , m;~ 1 ) , 

from n = 1 to n — N , observing that 

N N 1/2 N 111 



n—1 n—1 n—1 

The traditional way to proceed from here is to introduce the discrete velocity vector fields. 
However, for our existence proof, we shall not pursue that line of argument. Nonetheless, we 
briefly recall the definition and basic properties of the velocity field for the sake of completeness, 
and also since it provides a natural interpretation of the gradient flow structure. 

As each M™ is absolutely continuous, Theorem 12.11 guarantees the existence and the (M ™- 
essential) uniqueness of a Borel map t™ : R rf -> R d with (t") # M™ = M^" 1 , realizing the optimal 
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transport from M™ to M™ 1 in the definition of the Wasserstein distance (|2.3|) . Introduce the 
discrete velocity vector field v™ by 

(4.5) v?(a:):=i(x-t»(x)). 

For each n = 1, 2, . . the v" satisfies the nonlinear characterization 



(4.6) / C(*)-v?(a)dM?(s)=X a ,A[M?;C] 

for all test vector fields C G CcfCR'*, Rd )> with ^a.A as defined in ([Z2g]) . As showed by [27j . 
the relation (|4. 6[) follows by taking the first variation of the minimizing functional (|4.ip at the 
minimum point M" along the flow S s (/i) := X^/x, obtained by taking the push-forward of a 
measure /i under the maps (|2.22p induced by the vector field £. 

The discrete solution thus satisfies a system composed of the non-linear constraint (|4.6p , and a 
transport equation, which follows directly from the definition ()4.5|) and reads in weak form as 

(4.7) / <t>(x)d(M?{x) - M?-\x)) = [ ^(x)-^{x-T n w n T (x)))dM^{x). 

jR d JR d 

The system (|4.6[) - (|4.7[) is a canonical starting point for studying the continuous time limit. 

However, we shall take another approach which does not make use of the velocity vector fields. 
Instead, we will derive the weak form of the gradient flow equations in the continuous time limit 
by means of the next lemma. 

Lemma 4.2 (Discrete time derivative). Let a test function £ € C X3 (R d ) be given, which satisfies 
-\k\I < D 2 C, < \k\I for some k<0. Then 
(4.« 



jR d JR d 1 



)■ 



Moreover, the second moment satisfies 

(4.9) (I + 2Ar„)m 2 [M?] + W^M^M?" 1 ) = m 2 [M^ 1 } + 2S a T n ? a , [M?}. 

Notice that (|4.8p is a discrete local version of the weak formulation (|I ,23p for the gradient flow 
of 3 a> \- This can be seen after division of both sides by t„ > 0, and taking into account that we 
expect W^(M™, M™ _1 ) = o(r„) (at least in an integral sense) in the limit r„ J, 0. 

Proof. Choosing £ := — D £, the semi-group Sj coincides with the k- Wasserstein gradient flow of 
the functional V(n) := f Rd (dn [51 Example 11.2.2]. We apply the "flow interchange estimate" 
(j3~22"l) to U := J a , A , where 

D v l%) = -K Q , A [/i;C] = X a , A [/i;DC] 

by Lemma [2.51 Substituting V with —V yields (|4.8p . The particular choices £ := iK|x| 2 with 
K = ±1 correspond to V(/i) = ±^rri2[/x] in Theorem 13.21 and yield (|4.9p via Lemma [2~B1 □ 



4.2. An a priori estimate related to entropy dissipation. The classical estimates 
on the discrete scheme are sufficient to prove the existence of continuous limits of M T and v T as 
sup |r n | — > 0. An additional a priori estimate is needed to verify that these limits indeed satisfy 
the desired nonlinear evolution equation (|1.23p . Below, we obtain a sufficient estimate from the 
decay of the logarithmic entropy "K — 2IK 1 / 2 ,o along the discrete flow. 
To simplify notations, we introduce 

(4.10) /?:=—, 

a 

which is in one-to-one correspondence to a, and varies between (3 = for a = 1, and /3 = 1 for 
a = 1/2. 
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Theorem 4.3. Let M" = U™ L d be any solution of the discrete variational scheme (|4.1j) . and 
define = (U?) a . Then U? e W 2 ^{H d ) and a™ € W 2 ' 2 (R d ) satisfy for every n > 1 

(4.11) c r„ / ||D 2 ^fdx< XiMf-^-HlMfl+dXTn, where Co :=-( I -^j-r^p). 

J R d 1111 a V d(a + 2) I 

Proof. The expression on the left-hand side of (|4.11|) constitutes a lower bound on the dissipation 
rate of along the gradient flow of 5" ai A- In the spirit of Theorem l3.2[ the roles of "K and 3^a,\ will 
be interchanged: we calculate the dissipation of the information 3^. a along the Wasscrstein gradi- 
ent flow of 3i, which corresponds to the classical heat equation (see Remark l2.2p . The regularizing 
effect of the heat flow makes it comparatively easy to justify the necessary manipulations. 

Apply the "flow interchange estimate" (|3.22|) with the choice V := !H (which is geodesically 
convex, thus k = 0) and 11 := 3^. a- Then (|4.11| follows once we have shown that 

(4.12) D v U(m)>c / II D 2 crll 2 — cZA if fx = uL d , a = u a . 

Jn d 

This is the content of Lemma 14.41 below. □ 



A remark concerning the choice of the logarithmic entropy IK as a Lyapunov functional is due 
at this point. First, it is a convenient choice since the resulting entropy dissipation has the right 
homogeneity to provide W 1,2 -compactness for a easily. Second, it is a canonical choice since the 
heat flow generated by 'K dissipates every functional 'J : 3 (R d ) — > R that is jointly convex in u 
and Dm; so in particular it dissipates y a ,\ for 1/2 < a < 1. 

Lemma 4.4. Let fiQ = uo£j d € Dom(3") and let p, s = u s L d be the associated solution to the heat 
equation (|2.20p on R rf , which is given by (|2.2ip . If the right derivative liminfhjo tK^.aIm] — 
3 a ,\\ph\) i s finite, then a — u$ e M /2,2 (R rf ) and 

d + 1 f 2 

(4.13) - —3 r a .x[f-s}\_ n = liminf r (3 r Q ,A[Mo] - 3q. a[/^]) > c / 1 1 D 2 cr 1 1 dx - dX. 

as |s ~ u hio h x ' J Rd " " 

Remark 4.1. We emphasize that the dimension-dependent prefactor of [3 in the definition of Co 
is always less than one, and converges to one for d — ► oo. Hence, for < (3 < 1 — corresponding 
to \ < a < 1 — the coefficient Co is positive. Further, observe that in the case a = 1, it follows 
a = u, and (|4.13|) reduces for A = to the well known estimate 



d+ 1 
cL72 



Vu s \ 2 dx = [ |Au | 2 d:c = / |D 2 u | 2 da;. 

jR d JR d 



Proof. Since 5" q ,aM = 3q,oM + i m 2M and 

(4.14) -T-m 2 [Mfl] = v- / \x\ 2 dp s = -2[ x-Bu s dx^ i 2d[ u s dx = 2d, 
ds ds J Rd J Rd J Rd 

thanks to u s 6 W 1 ' 1 (R d ), we can simply consider the case of A = and evaluate the time derivative 
of5- Q ,o[/i s ] = ^/ Rt! |D<| 2 d.T. 

By standard parabolic theory, u s is a C°°-smooth, strictly positive probability density for each 
s > 0. Moreover, for every s > there exists a constant C s such that for all s > s andp 6 [1, +oo], 

(4.15) \\ u s\\Lp(n d ) + II D u s|lLp(R d ; R d ) + II D 2 u. s || £P(R d. R dxd) < C s - 

We stress that this classical estimate holds with any probability density uq as initial condition, 
thanks to the representation (|2.21[) of solutions to the heat equation. For e > we consider the 
smooth real functions 

(4.16) f E (r) := (e 1/a + r) a - s, re[0,+oo), 

whose first and second derivatives arc uniformly bounded; recall that 1/2 < a < 1. Then the func- 
tions p £ , s := f e (u s ) € C°°(R d ) satisfy bounds analogous to (|4.15p (with constants also depending 
on e > 0) and the evolution equation 

(4.17) d sPe = Ap^Pie + p^lDp^ 2 , 
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where B has been defined in (|4.10|1 . It follows that for every test function Q £ C^°(R d ), 

-T-T-/ \VPs,s\ 2 (dx = -- f Dp s -d s Dp e , s Cdx 
2a ds J Rd a J R c 

= - f Ap e (Ap £ + !3(e + p s )- 1 \Dp s \ 2 )(dx + - f (A Pe + 0(e + p^l B p £ \ 2 ) Dp £ • D(dx 

We integrate the previous inequality with respect to s from arbitrary points < s\ < S2 and we 
choose a family of cutoff functions ((x) = ( n {%) '■= Co(x2 - ™), where < Co < 1 an d Co(x) = 1 
if |x| < 1. The contribution of last integral vanishes in the limit as n — > +oo, since D£„(x) = 
2- n DCo(£2-"). Consequently, 

(4.18) -J-A / |Dp £ , s | 2 dx = i / ((Ap E ) 2 + BA Pe (e + p^V Pe \ 2 ) dx 

la ds j-Rd a J R d V / 

(4.19) = i J ((A Pe ) 2 + 4/3 Ap £ | D z £ | 2 ) dx. 

where z £ := (e + fe) 1 ^ 2 — e 1 ^ 2 satisfies 2Dz £ = (e + p^ 1 / 2 D p E . 
For further estimation, we recall that 

(4.20) = / ||D 2 p £ || 2 dx- / (Ap E ) 2 dx, 

and the following integration by parts rule (see [3H Theorem 3.1] for a proof), 

(4.21) 0=/ div(|Dz £ | 2 Dp £ )dx = / (Ap e \ D z e ? + 2 D 2 p s D z e ■ D z £ - 4| D zJ 4 ) dx, 
Introducing the parameters 

(A 00\ r Mf3 1 ( 8/3 r r -L. 1 A X ( d ~ ^ 

(4 ' 22) Cl = -^T^ = ato- 4/3 > C2 = Co + ^T^ = al 1 + W^> 

we add ci times the equality (|4.2ip and C2 times the equality (|4.20p to the dissipation relation 
dUIS]). This gives 

-is/ B j D ^^-»/ R j' D2 ^^ + ^2)L{w- i »ii D2 ' > -ii 3 -V( a ^ 

(4.23) - 8dD 2 p e Dz £ • D z £ + 8 Ap £ |D z £ | 2 + 16d |D z £ | 4 1 dx. 



As a final ingredient, we need the following elementary estimate for the error term in the Cauchy- 
Schwarz inequality (see e.g. [23 Lemma 2.1] for a proof). 

Lemma 4.5. Let A = (ay) &e a real symmetric d x d-matrix, and let ^ e £ H d be an arbitrary 
vector. Then the square-norm ||A|| 2 = ^ a 2 ^ emd ifte irace trA = ^2 an of A satisfy 

(4.24) (d- 1)||A|| 2 - ^(trA) 2 > -J^ (d A e • e - trA |e| 2 ) 2 . 

Lemma 14.51 is applied with A = D 2 p £ and e = D z e to estimate the norm of the Hessian inside 
the second integral in (|4.23p . A straight-forward calculation reveals 

(4.25) / |Dp £ | 2 dx>c / ||D 2 p £ || 2 dx 
2a ds J Rd J Rd 

dB f /D 2 p £ Dz, • D z e 1 , , 9 \ 2 

(4-26) + ^ / ( 1 ^ _ 8 , a -- 3 Ap £ -4|Dz £ | 2 dx. 



a{d + 2)J Rd \ |Dz £ | 2 d 

In view of B > 0, the last term is non-negative. Integration of (|4.25p from s = 0tos = /i>0 thus 
provides 

A/ |Dp E)ft | 2 dx + c / / ||D 2 p £ , s || 2 dxd S < A f |Dp £i0 | 2 dx 
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By definition of p E from u by means of f e given in (|4.16|) . it follows, 

Using that a < 1, we conclude by definition of 7 a ,o, 

-L/ |D Pe , | 2 d.* = f / - 1/ J D ; / °' 2 N2n ^ <J a , M. 
2a / R d 2 7 Rt! (jxq + e i/«)2(i-«) 

Moreover, it clearly follows that u £ , s L d — > /i s = p s £ d in the sense of 3 3 (R d ) as e J, 0, pointwise in 
s G [0, h}. Hence, by lower semi-continuity of 9^,0 and of the W^ ' -semi-norm, 

f h 

^a,o\Mh] + c 



/ / \\B 2 p h \\ 2 dxds < J a ,oH 
Jo Jn d 



After addition of the contribution due to the confinement potential, and division by h > 0, the 
limit h I provides the desired estimate (|4.13[) . □ 

4.3. Passage to the continuous time limit. To conclude the construction of the discrete 
approximation, we introduce interpolants of and C7" for all times t > 0: denote by M T , U T , 
and t,-, respectively, the right-continuous piecewise constant functions with 

M T (i) := M; 1 = U?Z d , U T {t) := £/", t T := t n T for t G (t^T 1 , i?] . 

Moreover, introduce accordingly ov = (C/ X ) Q . 

We recall the notion of generalized minimizing movement introduced by De Giorgi [17] (and 
here adapted to the case of a scheme with variable time steps) : 

Definition 4.6 (Generalized minimizing movements). For a given partition 7 T , the function 
M T : [0,+oo) -> ? 2 (R d ) obtained by piecewise constant interpolation of the solutions M™ of the 
minimization scheme (|4.ip are called discrete solutions. 

We say that a curve fi G C°([0, +oo); ^(R )) is a generalized minimizing movement /or 
iw'i/i initial datum fio G J ) 2(R ti ) */ i/iere errisf 
^ a sequence of partitions wi£/i limj ; _ ) . 00 sup {r„ : J^ILi T i — ^} = / or ever?/ T > 0; 

(iij sequence of initial data M® converging to /iq in 3>2(R d ) with !K at \[M® ] — > 3^ Qi a[mo] aM <^ 

(raj a corresponding sequence of discrete solutions M Tk {t) such that 

(4.27) M Tk {t)^n{t) in ^(R^) Vi > 0. 

FFe denote by GMM{$ a% \)iiQ) the collection of all the generalized minimizing movements. 

Concerning assumption (ii), we remark that convergence of 3a. a implies convergence of J{. a ,\ if 
/i G Dom(3 r Qj A). 

The next theorem is the main statement of this section and in particular yields Theorem 1 1.1 1 in 
the case fin G Dom(9 r Q: A)- 

Theorem 4.7. Assume that fin G Dom(3 r Q!i A). For every sequence of partitions Tk and discrete 
solutions M Tk {t) whose initial data M Tk (0) = M° fc satisfy conditions (i) and (ii) of the previous 
definition, there exists a subsequence (still denoted by Tk) such that as k —> oo, 

(4.28) M Tk = U Tk £j d — > (J, = uL d in 3 3 2(R d ); uniformly in each compact interval [0,T], 

(4.29) U Tk {t)^u{t) m J L a+1/2 (R d )nL 1 (R d ) Vt > 0, 

(4.30) a Tk -> a strongly m L 2 ((0,T);W h2 (R d )), and weakly in L 2 ((0,T);W 2 ' 2 (R d )), VT > 0, 

(4.31) ? a! x[M Tk (t)\ - 9 a ,x\n(t)] for tf-a.e. t > 0. 

The limit u is a solution of in the weak form (|1.20|l . (|1.21|1 . /n particular, any generalized 

minimizing movement \i = w£ d G GMMfJa^;^) is a weak solution of m £/iis sense. 
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Proof. The proof is naturally divided into various steps. 

Step 1: Compactness. We prove that there exists a suitable subsequence (still denoted by Tk) such 
that for every t, T > 

(4.32) M Tk (t) = U Tk {t)L d -> fi(t) = u(t)H d in?(R d ), a Tk (t) -> <j(t) strongly in L 2 (R d ), 

(4.33) a Tk {t) -> a(t) weakly in W 1,2 (R d ), a Tk -> tr weakly in L 2 ((0, T); FF 2,2 (R d )). 

Pointwise weak convergence in J"(R d ) follows immediately from (|4. 3[) and a simple extension of 
Ascoli-Arzela compactness Theorem (see e.g. [3 Prop. 3.3.1]), which in particular implies 

(4.34) W 2 (M r (t),M T {s)) < (2IF Q , A [M°]) 1/2 ■ (t T - s^) 1 ' 2 VO < s < s T < t < t T . 

Strong (resp. weak) convergence in L 2 (R d ) (resp. in W 1,2 (R d )) then follows by the properties of 
3a,A stated in Lemma |2~2"1 and the uniform bound ^^[M™] < ^[M"], sec P~2"]) . 

An explicit bound of the quadratic moment, even in the case A = 0, can be easily obtained 
from (|4.9[) . which gives 

(4.35) m 2 [M x (t)] < m 2 [ M o] + 2<5 Q t T ? ai o(M°). 

Finally, weak convergence in L 2 ((0,T);W 2 ' 2 (R d )) is a consequence of the uniform bound 

(4.36) c / / || D 2 cr T || 2 d^dt < M[M°] + Trm 2 [AI°] + (d\ + 2nS a 3 : a . [AI°])(T + t) VT > 0. 

Jo Jn d 

Here and below, r := supfr^ | i € N} controls the time step size in the given partition, and 
inequality (|4.36p is obtained by summing the central a priori estimate ([4. lip for a T with the 
bound (|4.9p on the second moment, taking into account (see e.g. [531 Sect. 2.3]) that 

(4.37) + 7rm 2 [M] > V/x e T 2 (R d )- 

Step 2: Discrete equation. We prove that for every < s < s T = < t < t x = and £ G 
C£°(R d ) ra'i/i -k/ < D 2 C < one /ias 



(4.38) / £dM T (t)- f CdM T (s)+ / ' W a , A [Af r (r);D(]dr 

jR d JR d Js T 

Summing up (|4.8p from n = m + 1 to n = p one gets 



<«rJ a ,JM°] 



[ t&Ml- ( (dM? + 2 n,N„,A[M T (r);DC] E ^W^ 1 )- 

7Rd n=m+l 1 n=m+l 

Claim (|4.38p above then follows recalling (|4.3p . 

Step 3: Continuous limit equation. We prove for the limit curve fi of step 1, that for every < 
s < t and ( e Co°(K d ), one has 

(4.39) / CMt)~ [ Cd/i(s)+ f H Q , A [M(r);DC]dr = 0, 

and that the Lebesgue density u(t, •) of /i(t) satisfies the weak formulation (|1.23p and (|1.2ip of the 
gradient flow (jl.ip . 

Equation (|4.39|) above follows by passing to the limit in (|4.38|) as k j oo and applying Step 
1. Notice that weak convergence of a Tk to a in £ 2 ((0, T); W 2 ' 2 (R. d )) and strong convergence in 
L 2 ((0,T); L 2 (R d )) immediately yields strong convergence in £ 2 ((0, T); W 1 ' 2 (R d )) and, up to the 
extraction of a further subsequence, (|4.3ip . Further, observe that the map a \— > is 
continuous and quadratic in Vt^ 1,2 (R d ). Hence, one can pass to the limit in the integral term 

X^>r a ,A[M T (r);DC] dr. 

It is then a standard fact that (|4.39p implies that a satisfies the weak formulation Q1.23P . Finally, 
the other weak formulation (|1.2ip can be deduced integrating by parts, which is admissible since 
a = u a e L 2 OC ([0, +oo); W 2 > 2 (R d )); see RemarkO 

Step 4: Convergence of quadratic moments. We prove that for every t > 0, 

(4.40) lim m 2 [M Tk (t)\ = m 2 [/i(t)] so that M Tk {t) -> fi(t) m T 2 (R d ). 
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Insert the family of test functions — introduced in (|2.33|) — in formula (|4.39|) above. Recalling 
that € T2 (R d ) and arguing like in the derivation of (|2.35[) . we get 

(4.41) m 2 [v(T)]+2\ [ m 2 [/x(r)] dr = m 2 [mo] + 26 a [ 3 a , [fi(r)] dr 



On the other hand, summing up estimate (|4.9|) from n — 1 to TV yields 

(4.42) m 2 [M T (T)] + 2 A f m 2 [M T (r)] dr < m 2 [fx ] + 25 a [ ^ [M T (r)] dr 

Jo Jo 

The strong convergence of ay^ to cr in L 2 OC ((0, +oo); M^ 1 ' 2 (R d )) permits passage to the limit in the 
right-hand side of (|4.42p as k | oo. Convergence of M Th (t) in CP(R d ), and the lower semicontinuity 
of the quadratic moment yield for all T > 0, 



limsupm 2 [M nt (T)] < m 2 [n(T)} + 2\( f 

feT + oo \Jo 



m 2 [n(r)} dr - liminf / m 2 [M Tfc (r)] dr < m 2 [/i(T)]. 

fc T + oo Jo / 

This implies pointwise convergence (and a posteriori also locally uniform convergence, see (|4.34p ) 
of M Th {t) in 3> 2 (R d ). 

Step 5: Convergence in L a+1 / 2 (R d ). We prove that the densities U T converge strongly in L a+1 / 2 . 
In view of the previously established convergence results, is obviously sufficient to show that the 
norms ||£/i-(*)||.L<*+i/2(R<i) converge to their respective limits ||it(i)|| it »+i/2( R d). Observe that these 
norms and the (unperturbed) entropies are equivalent, 

\\UAt)\\ a L tU% = ^^W a , [M T {t)] + 1. 

By the convergence of M Tk in J' 2 (R d ) and the uniform boundedness of S^o, 

(4.43) |5£a,o[Af TJb (t)] -5C a , [A*(t)]| < W 2 (M Tk (t), fi(t))(? afi (M Tk (t)) + ? a , Q (ji(t))) 1/2 

converges to zero, and so does the difference of the respective L Q+1 / 2 (R d )-norms. 

Thus, the proof of Theorem 14.71 is complete. □ 



To complete the proof of Theorem 1 1.1) it remains to treat the case in which the initial condition 
Mo has finite logarithmic entropy H, but infinite information. 

Theorem 4.8 (Regularizing effect). Suppose /j,q = uo£j d <£. Dom(3 r at \) but (|1.19|) is satisfied, 
and the discrete initial data M® = U®L d converging to /io in 7' 2 (R d ) satisfy !K[M°J — ► H[uq] < 
+oo. Then the conclusions of Theorem \4- 7| remain valid even in the absence of hypothesis (ii) of 
Definition \4-6[ except that (|4.28|) and (|4.29|) may fail at the endpoint t = 0. 

Proof. A combination of Sobolcv inequalities and standard interpolation results yields 

(4.44) Ve>03Q e >0: || D a\\ 2 L2(Rd) < e || D 2 a\\ 2 L2{Rd) + Q e \\af LUa{m 

for every function a G IF 2 ' 2 (R d ) n L 1/a (R d ); recall that \ja < 2. 

Multiply the bound (|4. 9(1 on moments by 27r, and take the sum with the a priori estimate (|4.11[) . 

•K\M^\ + 27rm 2 [Af; 1 ] + c r n || D 2 < + 27rm 2 [A/?- 1 ] 

+ r„ ( Xd + 2oT 1 n6 a \ | D cr™ 1 1 \ 2 (Rd; 

Choosing e > in estimate (|4.44|) such that 2a~ 1 Tr5 a s — Cq/2, and observing that ||cr™|| L i/ Q( - R d) = 
1, we obtain 

(4.45) !K[M™] + 27rm 2 [M™] + ^r„|| D 2 a?|| 2 2{Rd) < ^[M^ 1 ] + 2nm 2 {M?- 1 } + Cr n , 
where C := Xd + 2a~ 1 ir5 a Q e . In particular, in view of inequality (|4.37|) this implies the estimate 

(4.46) ^ f ||D 2 a T (i)|| 2 2(Rt!) dt < M[M°]+2nm 2 [M°} + CT T . 
2 Jo 



26 



DANIEL MATTHES, ROBERT J. MCCANN, AND GIUSEPPE SAVARE 



Now combine the fact that n i— » J a ,\\M™] is nonincreasing with the moment estimate (|4.42|) and 
inequality (|4.46p above to deduce both a pointwise and an integral bound for the functional Ja.x'- 

T7 a ,x[M T {T)]< [ ? a , x [M T (t)]dt 
Jo 

< (1 + SJ2) [ T J a , [MAt)} dt + ]m 2 [M°] 

Jo 4 

< fj (4 D 2 ^(t)||| 2(Rd) + Q e ) dt + im 2 [M°] 

< d(H[M°] + 27rm 2 [M°]) + C 2 (1 + T T ). 

With these estimates at hand, we can repeat the arguments given in the proof of Theorem 14. 71 on 
each interval [eh, +oo), for a vanishing sequence £h > 0. A standard diagonal argument concludes 
the proof. □ 



5. Convergence to Equilibrium 

The large time behavior of the previously constructed weak solutions u(t) to (|l.ip is described as 
follows. For a positive confinement force A > 0, the solutions converge to a Barenblatt profile 6 a ,A> 
which is the common minimizcr of the entropy H a .\ and the information F a ^\, in L 1 (R d ) at the 
exponential rate of A; see section 1531 below. In addition to the convergence estimates in L 1 (R d ). 
we obtain estimates for the equilibration of the information F at \. For a = 1, this translates into 
a convergence estimate for u(t) in W 1,2 (R, d ); see section l5~2l 

For vanishing confinement, no minimizer of F a $ exists, and solutions converge weakly to zero. 
However, the solutions become asymptotically self-similar: they converge in L 1 (R d ) after a suitable 
rescaling. The limit is again a Barenblatt profile 6 Qj a- The rescaling arguments are spelled out in 
section 15.31 

To close the discussion, we provide an alternative (but largely formal) argument that leads to 
L 1 (R d ) convergence in section [S~4l The argument extends the idea developed by Carrillo and 
Toscani |15j for the confined thin film equation (a = 1 and A = 1) in d = 1 dimension. 

5.1. Equilibration of the confined gradient flow. Throughout this section, we assume posi- 
tive confinement A > 0. 

In order to study the equilibration of solutions to (|l.lj) . we shall employ the estimates (|3.24|) 
and (|3.38|) that have been derived in the general metric framework in section [3] At the very basis 
of our argument is the intimate relation (jl.lOp between the entropy J{ a ,\ and the information 
&a, a- We recall their definitions, 




And, according to Corollary 12. 3[ the information can be expressed in terms of the entropy as 
follows, 

(5.2) J QiA [/i] = |cW Q , A | 2 ( M ) + Ok - 2)AJt a , A [/i]. 

Thus, is connected to !K QiA in precisely the same way that U is connected to V in the equation 
(|3.23p considered in section O see ()5.7|) below for the exact correspondences. 

Since A > 0, also A = A q A > 0. Among the probability densities on R d , there exists exactly one 
minimizer /3 q ,a € of 0i a , a, and it is also the unique minimizer of 3a, a- This follows immediately 
from Theorem 13.21 For a E (1/2, 1], the minimizer (3 a _\ = b a ,\L d is given by a Barenblatt profile 
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(ll.Ga.p of mass m = 1 , which naturally coincides with the unique stationary solution of the slow 
diffusion equation (|f .3a|) . With the coefficients defined as above, this profile takes the form 

(5.3) 6 a , A (x) = (a - bM 2 ); /(Q - 1/2) , b = ^Z^A, so that D (^-b^ + ± |*| 2 ) = 0. 

V 2a Act — 1 z 

Here a > is a parameter chosen to adjust the mass of b a ^\ to one. For a = 1/2, the minimizcr 
0i x is a Gaussian measure. 

To simplify calculations, we shall use normalized versions of entropy and information, which 
are non- negative and vanish exactly in the equilibrium state of p.l|) . Introduce accordingly 

Since !H a ,\ is geodesically A-convex in ^(R^), it satisfies the bounds 

(5-4) ^WlWa.A) < & Q ,aM < T^-I^.aI 2 ^). 

As Otto and Villani note, these bounds generalize the logarithmic Sobolcv and Talagrand trans- 
portation inequalities |36j . 

The equilibration estimate (|I.25[) is a consequence of the following exponential decay of the 
entropy. 

Theorem 5.1. Under the assumption of a positive confinement strength A > 7 any solution fj,(t) 
to problem p.l[) (|l .20 constructed in Section^ satisfies 

(5.5) | Wi((i(t),p a , x ) < £«,a[m(*)] < cxp(-2At)M ct , A [^o], 

(5.6) < exp(-2At) J a , A [Mo]- 

Proof. Since J{ ai A is geodesically A-convex in 3 2(R <1 ), it generates a A-flow (see Theorem 12. 4j) . 
Because satisfies ()5.2|) . wc can apply Theorem l3.2l in the complete metric space X := ^(R^) 
with 

(5.7) V:=M Q , A , U:=5* aiA , k := A, 0:=-A5>/c, 4k6> = 2A. 
By ()3.24[) the discrete solutions M™ introduced in section 14.11 satisfy 

(5.8) (1 + 2At„)A q ,a[M^] < MiM?- 1 }, (1 + 2\T n )? a , x [M?} < ^[M?" 1 ]. 

Recalling Remark 13. 2[ the lower semicontinuity of ^Ha^JajA) and the pointwise convergence of 
M T to /i in J> 2 (R d ) we conclude both (J53|) and (J5J>1). □ 

To conclude estimate (|1.25[) of Theorem II. 2 [ we invoke a Csiszar-Kullback inequality. The latter 
allows to estimate the Z^-distance of a given probability density u to the Barenblatt profile b a \ 
in terms of the normalized entropy 3i Q; A- The following result can be found in [TH Theorem 30]. 

Lemma 5.2. There exists a positive constant c Qj a only depending on a and A such that for every 
measure fi = utl d G ^(R^) 

(5-9) \\u - &a,A||£i(R<i) < C Q ,A M Q:A [/i] 1/2 . 

Clearly, (|1.25|) is obtained in combination of (|5.9p with (|5.5[) . 

5.2. Equilibration in W 1,2 (R. d ) for the thin-film equation. In the special case a = 1 of the 
thin film equation, the exponential convergence of the information functionals has the consequence 
that the solution u(t) converges to its steady state in W 1 ' 2 (R d ), also exponentially fast. 

Theorem 5.3. Assume a = 1 and A > 0. Then there exists a constant (only depending on 
A and d) such that any weak solution u(t) constructed in the proof of Theorem \l.l \ satisfies 

(5-10) \\u(t) - &i,A||^i,2 (R d) < C\,dFi,\( u o) exp(-2At). 
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Above, the stationary state 

bi,\(x) = —— — ^r(p 2 — \ x \ 2 ) 2 ,, P = Px > chosen to adjust the mass to unity, 
is the Smyth-Hill profile. 

Proof. For a given probability density u, define the linear segment ug = 8u + (1 — #)&i.a for 
< 9 < 1. Since b\.\ is the minimizcr of F\.\, 

(5.11) fuM-fuN>o 

for all 6 G [0, 1]. Divide (j5~TTj) by 6» > and calculate the limit 6 J. 0, finding 
/ (D«-D6i, A ) ■Dfei, A da; + ^ / |x| 2 (u - 6 M ) dx > 0. 

jR d ^ JR d 

In view of the definition of Fx,\, it follows easily that 

(5.12) I f |D U -D6 1 , A | 2 dx<^ 1 , A (u)-F liA (6 1 , A )=y 1)A [M]-Ji )A [A,A]=yi,AM- 
The information decay ()1.27p thus implies 

(5.13) \\T>u(t)-Db hX \\ 2 L2(Rd) < 2F 1 , A (H )e X p(-2At). 

Interpolation of this estimate with the equilibration in L 1 from (|1.25|) provides the L 2 -estimate 

(5.14) \\u(t) - b ltX \\ 2 L2(Ri) < H hX (u ) exp ( - 2Xt). 

Since A > and § > \d3i\ 2 > 2AJC, the claim (jlUOf follows by combination of ([5713]) with 
dOH . □ 

5.3. Equilibration in the absence of confinement. We shall now consider the gradient flow 
(|l.ip in the limit of vanishing confinement, A = 0. In fact, one may argue that the most natural 
gradient flow to study in this context is not (jl.ip associated to the functional F a: x, but the one 
for the unperturbed information F a fi. The corresponding equation reads 

(5.15) d t w = - div (w D [/^Ara"] ) . 

Since the unconfined information functional $ a ,o enjoys the scaling property (|1.12[) . solutions w 
to its gradient flow can be related to solutions u of a confined gradient flow as follows. 

Lemma 5.4. A function u G L 2 3C ((0, T); IF 2,2 (R d )) is a weak solution of (|1.1[) for A = 1 if and 
only if the function 

(5.16) w{t,-) = ^ m u{\ogR{t),-) = R(t)- d u{\ogR{t),R{t)- 1 -), R(t) = (l + (6 a + 2)t) 1,{Sa+2 \ 
is a weak solution of (|5.15p . 

The proof follows by straight-forward calculations. 

An immediate consequence of Lemma l5.4l is that equation (|5 . 1 5[) possesses self-similar solutions. 
One is defined by 

(5.17) b a , (t, •) := R{ty d b^Rit)' 1 ■ ) = * R(t )b a ,i(-), 



where b a ^\ is the Barenblatt profile from (|5.3|) with unit confinement strength, and the rescaling 
factor is given by 

(5.18) R(t) = (l + (S a + 2)t) 1/{Sa+2) . 



Another consequence of Lemma 15.41 are the intermediate asymptotics of solutions w. In the limit 
t — > oo, any solution w to (|5.15p converges to zero in Lj oc (R d ), and does so by approaching the 
self-similar solution from (|5.17|) . 
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Corollary 5.5. Let w be generalized minimizing movement induced by 3^,0 (and a fortiori a 
weak solution to equation (|5.15[) ). Then w approaches the self-similar solution 6 Q .o from (|5.17[) as 
follows, 

(5.19) \\w(t, ■) - b afi (t, -)|| Ll(Rd) < C ■ R(t)-\ 

where C > depends only on H at i(u>o) only. 

We remark that estimate (j5.19[) is a global estimate, which provides little useful information 
locally. Indeed, on any bounded set M C R d , the L 1 (M)-norm of w*(t) obviously converges to 
zero at rate R(t)~ d , which is at least as fast as the approximation of w*(t) by w(t) at rate ii(t) -1 . 
In order to obtain meaningful local rates, refined asymptotics would need to be studied, as has 
been done for the related slow diffusion equation with a > 1/2 and d = 1 by Angenent [I], and 
with a < 1/2 but d > 1 by Dcnzlcr, Koch and McCann [3U] [H]. Globally, however, ([STT5]) is a 
non-trivial statement, by its equivalence to ()1.25p . 

The goal is to obtain Corollary 15.51 as a consequence of estimate (jl.25j) of Theorem 11.21 The 
obstacle is that (|1.25| has not been proven for general weak solutions to (jl.ip . but just for those, 
which are obtained by means of the minimizing movement variational scheme. Hence, it remains to 
be shown that generalized minimizing movements for the rescaled and for the unrescaled equation 
are in correspondence. To this end, we prove the following (generalized) discrete counterpart of 
Lemma [ 



Theorem 5.6. Assume that a functional U : Ip2(R d ) - * ( — oo, +oo] satisfies 

(5.20) U(d Rf i) = R~ 5 U(n) V/i G y 2 (R d ), R > 0; Hr/j, = (R id) # M- 

Let M £ 2*2 (R ), t > and R > S > be given. A measure M € Dom(U) is a minimizer of the 
functional 

(5.21) p.-#( W M,T,A) := ^WiQi, M) + lC(/x) + ^m 2 [ M ] 
if and only if QrM minimizes 

$(//;J s M,f,A) = ^=W2Qi,T>sM) + U(ji) + ^m 2 [tA, with 

(5.22) 2r 2 

tR 

In particular, the result applies to U = & a ,a with 5 = 5 a . When comparing discrete solutions 
to p.ip and (|5 . 1 5[) , respectively, R represents the rescaling factor of the current time step, and S 
that of the previous one; Theorem 15.61 is applied with A = 0, so that R = (1 + Xt)S. 

Proof. By (|5.20[) and the rescaling property of the Wasserstein distance, 

(5.23) Wi(d s ^d s iy) = S 2 W^{^,v) V/i, v e 3 2 (R d ), 

it is sufficient to prove the theorem in the case 5=1, when (|5.22p corresponds to 

(5.24) $( W M,f,A) = ^W 2 2 ( M ,M)+l[(M) + ^m 2 M ! with t:=tR s +\ X := ~~Yr~~- 
Let us then introduce the functional W : Dom(lI.) x (0, +oo) — > R 

(5.25) *(/i,r) := ^ W|(D r /Lt, M) + r rU(/i) + ^r(l + Ar — r)m 2 [/i]. 
Observe that 

(5.26) T- 1 vI/( M ,l) = $( /i; M, T; A), (ri? 5 + 1 )" 1 *(M,-R) = $(i) iiM ;M,f,A) 

with f , A given by (|5.24|) . For fixed fj, G 3> 2 , the expression ^(/i, •) is linear in its second argument; 
in fact, if 7 G 3 3 2 (R d x R d ) is an optimal coupling in T(fj,,M), then (i?id,id)#7 is an optimal 
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coupling in r(0fl^, M). Hence, 

Wi[VrH,M)= I \rx-y\ 2 d 1 {x,y) = f (r 2 \x\ 2 - 2rx ■ y + \y\ 2 ) dj(x, y) 

Jn d x'R d JR d xR d v 7 



2 m 2 [lj]~2r x ■ydj + m 2 [M], 

JR d xR d 



and 



*(M,r)=r(- / x-yd 1 + TU( f i) + ^ 7 ^m 2 [fi}) + \m 2 [M}. 

V JR d xR d Z II 

Differentiating with respect to r we get 

£*(/i,r) = -/ a;-2/d7 + TU(^) + i^-^m2[M] 

<? r jR d xR d 1 

= \w 2 {»,M) + rU(fi) + ^-mM - \m 2 [M] = 1) - \m 2 [M]. 

If M is a minimizer of (|5.21|) . then it is also a minimizer of ^(-, 1) and of d r ^(-, r). It follows that 
(5.27) V{M,r) < *(/i,r) Vr > 1, /i e Dom(lt). 

In view of the second relation in (|5.26p . it follows that X)rM is a minimizer of (|5.22[) . The same 
argument shows that if M is a minimizer in (|5.22p . then >P(5 fi -iM, 1) coincides with fy(M, 1), and 
therefore 5 B -iM is a minimizer in (|5.2ip . □ 

By means of the previous theorem, we conclude that generalized minimizing movements indeed 
enjoy the same rescaling property stated in Lemma [ 



Corollary 5.7. A continuous curve [i € C°([0,+oo);7 2 CR, d )) belongs to GMM^a^]^) iff the 
rescaled curve u(t) = D/j( t )^(log R(t)) belongs to GMM{'J a fl, /io). 

Proof. Let M T be a discrete solution associated to the functional jF^i for a given partition 7 T . 
Define a sequence of discrete rescaling S"™ inductively by 

on cm — 1 

(5.28) S° T := 1, S" = (1 + tvOS™- 1 i.e. T — = S™" 1 . 

We remark that this is the explicit Euler scheme for solution of the differential equation 

(5.29) ±S(t) = S(t) 

on the partition 3V- Correspondingly, we introduce the new family of partitions 3^, induced by 

n 

(5.30) := TnS?- 1 (S") 1+ S s™ := £ r? TO . 

m— 1 

Further, we define the piecewise linear time rescaling map L T : [0,oo) — > [0,co) by 

(5.31) L r {%) := s™ L x is linear in [t? _ \t?]. 

By definition, L' T (t) = S^ 1 (S^) 5a+1 in (t™" 1 ,^). Employing Theorem EU with S := S 1 ; 1 " 1 and 
ii := 5", it is immediate to check that the sequence M™ defined as 

(5.32) M£ :=?><;„ 

is a discrete solution associated to the unperturbed functional J a .Q with respect to the rescaled 
partition rj. The corresponding piecewise constant interpolant M n satisfies 

(5.33) M n (L T (t))=a Sr(t) M r (f) > 

with 5V being the piecewise constant interpolant of the 5*™ with respect to the partition 3V • 

By definition, fi G GAfM(5' Q , j A; /io) implies that there exists a sequence of partitions Tk such 
that M Tk — > /i locally uniformly in 3^2 (R^)- Moreover, SV^ (t) — ► e* locally uniformly and LV fc (t) — * 
L(i) = 1 + (S a + 2) _1 e^ Q+2 ^ by (|5.30|) . Thus, up to the extraction of a further subsequence, we 
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find that -A-/^ converges locally uniformly in 3 3 2(R <i ) to some limit curve v. Again, by definition of 
the generalized minimizing movements, v £ GMM(J tti o; (Xq). Passage to the limit in (|5.33[) yields 

(5.34) v{L(t)) =Q e t[x(t) yt>0. 

Consequently, u{s) = O fi ( s )/z(log R(s)), since L(t) — s means that t = logR(s). The reverse 
implication can be proved in a similar way. □ 

Corollary 15.71 provides the sought correspondence between generalized minimizing movements 
for equation fll.lj) and for equation (|5.15j) . Corollary 15.51 thus becomes a direct consequence of 
estimate (|1.25j) from Theorem 1 1.2 1 



5.4. An alternative approach to entropy decay. The following constitutes an alternative 
approach to the equilibrium estimate (|5.5p . This alternative approach is purely formal in the range 
a > 1/2, where solutions are not generally smooth, but might be expected to apply rigorously 
to the (upper end of) the range a £ [(d — 2)/ 2d, 1/2], where examples suggest broader classes of 
solutions will be smooth (and possess rapid spatial decay). We include it here since it provides 
an enlightening view on the very particular structure of the gradient flow (|1.1|) . The key idea is 
to rewrite the equation ()1 . 1(1 in a way which is inspired by Carrillo and Toscani |15| and allows us 
to perform completely explicitly the estimates that resulted in an abstract way from the theory 
of gradient flows in the previous section. 

Assume that u is a smooth and strictly positive solution to (|1.1[) . which decays rapidly enough 
at for |x| — » oo to justify all subsequent integration by parts. A tedious but straight-forward 
calculation reveals that u satisfies 

(5.35) 9 tU = div( u DG«. A ) \ P A{u a+1 l 2 AG a +)-—^— P D 2 : (u Q+1 / 2 D 2 G Q . A ) 

2a + 1 2a + 1 

with the notation 

G a ,x =qu a 1/2 + -M 2 , p = V2a-, q=- --, 

2 A 2a — 1 A 

and A is as in (|5.1[) . Notice that the representation for the one-dimensional thin film flow used in 
[T5] is a special case of the general formula (|5 .35[) above. 
Next, rewrite the definition of 3i a ^\ in the form 

M ^M = f/(^fi72«° +V2 + ^)^-^ 

It is straight-forward to check that 

|^ Q , A | 2 = (j) 2 J u\DG a , x \ 2 dx. 
On the other hand, the time derivative of the relative entropy along a solution becomes 



A d f 
---J( a ,A[/j(t)] = - / G Q ,\d t udx 
A at j R d 



it I D G a a 1 2 dec 



2a -1 



[ u a + 1 / 2 \AG a , X \ 2 dx+-^—p[ U " +1 /2|| D 2 GQA ||2 d ^ 

J R d 2a + 1 J Rd 



2a +1 J Rl 

Neglect the contribution from the last two terms, which are non-negative for a > 1/2. (In fact, 
their sum is non-negative in the range a > (d — 2)/2d where IHc^Im] is Wa-geodesicaJly convex.) 
Instead, relate the first term of the entropy production to the entropy itself, observing that 

-jK a , x \}i{t)] > jJ^u\DG a , x \ 2 dx= ^|cW Q , A | 2 > (2A)K a , A , 
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employing the convexity estimat^ (|5.4p . One arrives again at the exponential decay estimate 
(|5.5p , and thus at an alternative proof of Theorem 11.21 Currently — lacking estimates on higher 
regularity of the solution u — it is, however, unclear how to turn this formal argument into a 
rigorous proof. 
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